WORLD INTCLLECTUAL PROPERXy ORGANIZATION 
bitematioDal Bureau 




PCX 

INTERNATIONAL APPUCATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) Intematioiial Patent Classification ^ : 

C12N 15W3, 15/10, lS/62, 15/70, 1/21, 
C07K 1/04, GOIN 33/53 



Al 



(11) International Publication Numbo-: WO 97/08320 

(43) International Publication Date: 6 March 1997 (06.03.97) 



(21) International Application Number: PCT/EP96/03647 

(22) International Filing Date: 19 August 1996 (19.08.96) 



(30) Priority Data: 

95113021.0 18 August 1995 (18.08.95) EP 

(34) Countries for which the regional or 

international application was filed: DE et al. 



(71) Applicant (for all designated States except US): MORPHOSYS 
GESELLSCHAFT FOR PROIEINOPTIMIERUNG MBH 
PE/DE]; Frankfurter Ring 193a, D-80807 Mtochen (DE). 



(81) Designated States: AU, CA, JP. US. European patent (AT, BE. 
OH, DE, DK. ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, 
PT, SE). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(72) Inventors; and 

(75) Inventors/AppHcants (for US only): KNAPPIK, Achim 
[DE/DE]; Killersttasse 16, D-82166 GrSfelfing (DE), 
PACK, Peter [DE/DE]; Franz-Wolter-Strasse 4, D-81925 
MOnchen (DE). ILAG, Vic [PH/DEJ; Knoirstrasse 85, 
D-80807 Mttnchen (DE). GE. Liming [CN/DE]; Ne- 
stroystrasse 17, D-81373 MQnchen (DE). MORONEY, 
Simon [N2/DE]; Osterwaldstrasse 44, D-80805 MQnchen 
(DE). PLCCKTHUN, Andreas [DE/CH]; MOhrlistrasse 97, 
CH-8006 Zurich (CH). 

(74) Agent: VOSSIUS & PARTNER; P.O. Box 86 07 67, D-81634 
Mtinchen (DE). 



(54)TiUe: PROTEIN/(POLY)PEPTIDE LIBRARIES 



(57) Abstract 

The present invention relates to synthetic DNA sequences which 
encode one or more collections of homologous proteins/(poly)peptidcs, and 
methods for generating and applying libraries of these DNA sequences. In 
particular, the invention relates to the preparation of a library of human- 
derived antibody genes by the use of synthetic cons^us sequences which 
cover the structural lepertoire of antibodies encoded in the human genome. 
Furthennore, the mvention relates to the use of a single consensus antibody 
gene as a universal framework for highly diverse antibody libraries. 
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Protem/(Poly)peptide Libraries 



Field of the Invention 

The present invention relates to synthetic DNA sequences which encode one or 
more collections of homologous proteins/(poly)peptides, and methods for 
generating and applying libraries of these DNA sequences. In particular, the 
invention relates to the preparation of a library of human-derived antibody genes by 
the use of synthetic consensus sequences which cover the stnjctural repertoire of 
antibodies encoded in the human genome. Furthermore, the invention relates to the 
use of a single consensus antibody gene as a universal framework for highly 
diverse antibody libraries. 

Background to the Invention 

All current recombinant methods which use libraries of proteins/(po!y)peptides, e.g. 
antibodies, to screen for members with desired properties, e.g. binding a given 
ligand, do not provide the possibility to improve the desired properties of the 
members in an easy and rapid manner. Usually a library is created either by 
inserting a random oligonucleotide sequence into one or more DNA sequences 
cloned from an organism, or a family of DNA sequences is cloned and used as the 
library. The library is then screened, e.g. using phage display, for members which 
show the desired property. The sequences of one or more of these resulting 
molecules are then determined. There is no general procedure available to improve 
these molecules further on. 

Winter (EP 0 368 684 B1) has provided a method for amplifying {by PGR), cloning, 
and expressing antibody variable region genes. Starting with these genes he was 
able to create libraries of functional antibody fragments by randomizing the CDR3 of 
the heavy and/or the light chain. This process is functionally equivalent to the natural 
process of VJ and VDJ recombination which occurs during the development of B- 
cells in the immune system. 

However the Winter invention does not provide a method for optimizing the binding 
affinities of antibody fragments further on, a process which would be functionally 
equivalent to the naturally occurring phenomenon of "affinity maturation", which is 
provided by the present invention. Furthermore, the Winter invention does not 
provide for artificial variable region genes, which represent a whole family of 
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Structurally similar natural genes, and which can be assemblsd from synthetic DNA 
oligonucleotides. Additionally. Winter does not enable the combinatorial assembly 
of portions of antibody variable regions, a feature which is provided by the present 
invention. Furthermore, this approach has the disadvantage that the genes of all 
antibodies obtained in the screening procedure have to be completely sequenced, 
since, except for the PGR priming regions, no additional sequence information about 
the library members is available. This is time and labor intensive and potentially 
leads to sequencing errors. 

The teaching of Winter as well as other approaches have tried to create large 
antibody libraries having high diversity in the complementarity determining regions 
(CDRs) as well as in the frameworks to be able to find antibodies against as -many 
different antigens as possible. It has been suggested that a single universal 
framework may be useful to build antibody libraries, but no approach has yet been 
successful. 

Another problem lies in the production of reagents derived from antibodies. Small 
antibody fragments show exciting promise for use as therapeutic agents, diagnostic 
reagents, and for biochemical research. Thus, they are needed in large amounts, 
and the expression of antibody fragments, e.g. Fv, single-chain Fv (scFv). or Fab in 
the periplasm of E. coli (Skerra & Pluckthun. 1988; Better et al., 1988) is now used 
routinely in many laboratories. Expression yields vary widely, however. While some 
fragments yield up to several mg of functional, soluble protein per liter and OD of 
culture broth in shake flask culture (Carter et al., 1992, Pluckthun et al. 1996), other 
fragments may almost exclusively lead to insoluble material, often found in so-called 
inclusion bodies. Functional protein may be obtained from the latter in modest yields 
by a laborious and time-consuming refolding process. The factors influencing 
antibody expression levels are still only poorly understood. Folding efficiency and 
stability of the antibody fragments, protease lability and toxicity of the expressed 
proteins to the host cells often severely limit actual production levels, and several 
attempts have been tried to increase expression yields. For example, Knappik & 
Pluckthun (1995) could show that expression yield depends on the antibody 
sequence. They identified key residues in the antibody framework which influence 
expression yields dramatically. Similarly, Ullrich et al. (1995) found that point 
mutations in the CDRs can increase the yields in periplasmic antibody fragment 
expression. Nevertheless, these strategies are only applicable to a few antibodies. 
Since the Winter invention uses existing repertoires of antibodies, no influence on 
expressibility of the genes is possible. 
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Furthermore, the findings of Knappik & Pluckthun and Ullrich demonstrate that the 
knowledge about antibodies, especially about folding and expression is still 
increasing. The Winter invention does not allow to incorporate such improvements 

into the library design. 

The expressibility of the genes is important for the library quality as well, since the 
screening procedure relies in most cases on the display of the gene product on a 
phage surface, and efficient display relies on at least moderate expression of the 
gene. 

These disadvantages of the existing methodologies are overcome by the present 
invention, which is applicable for all collections of homologous proteins. It has the 
following novel and useful features illustrated in the following by antibodies as an 
example: 

Artificial antibodies and fragments thereof can be constructed based on known 
antibody sequences, which reflect the structural properties of a whole group o 
homologous antibody genes. Therefore it is possible to reduce the number of 
different genes without any loss in the structural repertoire. This approach leads to a 
limited set of artificial genes, which can be synthesized de novo, thereby allowing 
introduction of cleavage sites and removing unwanted cleavages sites. Furthermore, 
this approach enables (i). adapting the codon usage of the genes to that of highly 
expressed genes in any desired host cell and (ii). analyzing all possible pairs of 
antibody light (L) and heavy (H) chains in terms of interaction preference, antigen 
preference or recombinant expression titer, which is virtually impossible using the 
complete collection of antibody genes of an organism and all combinations thereof. 

The use of a limited set of completely synthetic genes makes it possible to create 
cleavage sites at the boundaries of encoded structural sub-elements. Therefore, 
each gene is built up from modules which represent structural sub-elements on the 
protein/(poly)peptide level. In the case of antibodies, the modules consist of 
"framework" and "CDR" modules. By creating separate framework and CDH 
modules, different combinatorial assembly possibilities are enabled. Moreover, if 
two or more artificial genes carry identical pairs of cleavage sites at the boundaries 
of each of the genetic sub-elements, pre-built libraries of sub-elements can be 
inserted in these genes simultaneously, without any additional information related to 
any particular gene sequence. This strategy enables rapid optimization of, for 
example antibody affinity, since DNA cassettes encoding libraries of genetic sub- 
elements can be (i). pre-built. stored and reused and (ii). inserted in any of these 
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sequences at the right position without knowing ths actual sequence or having to 
determine the sequence of the individual library member. 

Additionally, new information about amino acid residues important for binding, 
stability, or solubility and expression could be integrated into the library design by 
replacing existing modules with modules modified according to the new 
observations. 

The limited number of consensus sequences used for creating the library allows to 
speed up the identification of binding antibodies after screening. After having 
identified the underlying consensus gene sequence, which could be done by 
sequencing or by using fingerprint restriction sites, just those part(s) comprising the 
random sequence(s) have to be determined. This reduces the probability of 
sequencing errors and of false-positive results. 

The above mentioned cleavage sites can be used only if they are unique in the 
vector system where the artificial genes have been inserted. As a result, the vector 
has to be modified to contain none of these cleavage sites. The construction of a 
vector consisting of basic elements like resistance gene and origin of replication, 
where cleavage sites have been removed, is of general interest for many cloning 
attempts. Additionally, these vector(s) could be part of a kit comprising the above 
mentioned artificial genes and pre-built libraries. 

The collection of artificial genes can be used for a rapid humanization procedure of 
non-human antibodies, preferably of rodent antibodies. First, the amino acid 
sequence of the non-human, preferably rodent antibody is compared with the amino 
acid sequences encoded by the collection of artificial genes to determine the most 
homologous light and heavy framework regions. These genes are then used for 
insertion of the genetic sub-elements encoding the CDRs of the non-human, 
preferably rodent antibody. 

Surprisingly, it has been found that with a combination of only one consensus 
sequence for each of the light and heavy chains of a scFv fragment an antibody 
repertoire could be created yielding antibodies against virtually every antigen. 
Therefore, one aspect of the present invention is the use of a single consensus 
sequence as a universal framework for the creation of useful (poly)peptide libraries 
and antibody consensus sequences useful therefor. 
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The present invention enables the creation of useful libraries of (poly)peptides. In a 
first embodiment, the invention provides for a method of setting up nucleic ac.d 
sequences suitable for the creation of said libraries. In a first step, a collection of at 
least three homologous proteins is identified and then analyzed. Therefore, a 
database of the protein sequences is established where the protein sequences are 
aligned to each other. The database is used to define subgroups of protein 
sequences which show a high degree of similarity in both the sequence and. if 
information is available, in the structural arrangement. For each of the subgroups a 
(poly)peptide sequence comprising at least one consensus sequence is deduced 
which represents the members of this subgroup; the complete collection of 
(poly)peptide sequences represent therefore the complete structural repertoire of 
the collection of homologous proteins. These artificial (poly)peptide sequences are 
then analyzed, if possible, according to their structural properties to identify 
unfavorable interactions between amino acids within said (poly)peptide sequences 
or between said or other (poly)peptide sequences, for example, in multimeric 
proteins Such interactions are then removed by changing the consensus sequence 
accordingly. The (poly)peptide sequences are then analyzed to identify sub- 
elements such as domains, loops, helices or CDRs. The amino acid sequence .s 
backtranslated into a corresponding coding nucleic acid sequence which is adapted 
to the codon usage of the host planned for expressing said nucleic acid sequences. 
A set of cleavage sites is set up in a way that each of the sub-sequences encoding 
the sub-elements identified as described above, is flanked by two sites which do not 
occur a second time within the nucleic acid sequence. This can be achieved by 
either identifying a cleavage site already flanking a sub-sequence of by changing 
one or more nucleotides to create the cleavage site, and by removing that site from 
the remaining part of the gene. The cleavage sites should be common to all 
corresponding sub-elements or sub-sequences, thus creating a fully modular 
arrangement of the sub-sequences in the nucleic acid sequence and of the sub- 
elements in the corresponding (poly)peptide. 

In a further embodiment, the invention provides for a method which sets up two or 
more sets of (poly)peptides. where for each set the method as described above is 
performed, and where the cleavage sites are not only unique within each set but 
also between any two sets. This method can be applied for the creation of 
(poly)peptide libraries comprising for example two a-helical domains from two 
different proteins, where said library is screened for novel hetero-associat.on 
domains. 
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In yet a further embodiment, at least two of the sets as described above, are derived 
from the same collection of proteins or at least a part of it. This describes libraries 
comprising for example, but not limited to, two domains from antibodies such as VH 
and VL, or two extracellular loops of transmembrane receptors. 

In another embodiment, the nucleic acid sequences set up as described above, are 
synthesized. This can be achieved by any one of several methods well known to the 
practitioner skilled in the art, for example, by total gene synthesis or by PCR-based 
approaches. 

In one embodiment, the nucleic acid sequences are cloned into a vector. The vector 
could be a sequencing vector, an expression vector or a display (e.g. phage display) 
vector, which are well known to those skilled in the art. Any vector could comprise 
one nucleic acid sequence, or two or more nucleic sequences, either in different or 
the same operon. In the last case, they could either be cloned separately or as 
contiguous sequences. 

In one embodiment, the removal of unfavorable interactions as described above, 
leads to enhanced expression of the modified (poly)peptides. 

In a preferred embodiment, one or more sub-sequences of the nucleic acid 
sequences are replaced by different sequences. This can be achieved by excising 
the sub-sequences using the conditions suitable for cleaving the cleavage sites 
adjacent to or at the end of the sub^sequence, for example, by using a restriction 
enzyme at the corresponding restriction site under the conditions well known to 
those skilled in the art, and replacing the sub-sequence by a different sequence 
compatible with the cleaved nucleic acid sequence. In a further preferred 
embodiment, the different sequences replacing the initial sub-sequence(s) are 
genomic or rearranged genomic sequences, for example in grafting CDRs from non- 
human antibodies onto consensus antibody sequences for rapid humanization of 
non-human antibodies. In the most preferred embodiment, the different sequences 
are random sequences, thus replacing the sub-sequence by a collection of 
sequences to introduce variability and to create a library. The random sequences 
can be assembled in various ways, for example by using a mixture of 
mononucleotides or preferably a mixture of trinucleotides (Virnekas et al.. 1994) 
during automated oligonucleotide synthesis, by error-prone PGR or by other 
methods well known to the practitioner in the art. The random sequences may be 
completely randomized or biased towards or against certain codons according to 
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the amino acid distribution at certain positions in known protein sequences. 
Additionally, the collection of random sub-sequences may comprise different 
numbers of codons. giving rise to a collection of sub-elements having different 
lengths. 

In another embodiment, the invention provides for the expression of the nucleic acid 
sequences from a suitable vector and under suitable conditions well known to those 
skilled in the art. 

In a further preferred embodiment, the (poly)peptides expressed from said nucleic 
acid sequences are screened and, optionally, optimized. Screening may be 
performed by using one of the methods well known to the practitioner in the art, such 
as phage-display. selectively infective phage, polysome technology to screen for 
binding, assay systems for enzymatic activity or protein stability. (Poly)peptides 
having the desired property can be identified by sequencing of the corresponding 
nucleic acid sequence or by amino acid sequencing or mass spectrometry. In the 
case of subsequent optimization, the nucleic acid sequences encoding the initially 
selected (poly)peptides can optionally be used without sequencing. Optimization is 
performed by repeating the replacement of sub-sequences by different sequences, 
preferably by random sequences, and the screening step one or more times. 

The desired property the (poly)peptides are screened for is preferably, but not 
exclusively, selected from the group of optimized affinity or specificity for a target 
molecule, optimized enzymatic activity, optimized expression yields, optimized 
stability and optimized solubility. 

In one embodiment, the cleavage sites flanking the sub-sequences are sites 
recognized and cleaved by restriction enzymes, with recognition and cleavage 
sequences being either identical or different, the restricted sites either having blunt 
or sticky ends. 

The length of the sub-elements is preferably, but not exclusively ranging between 1 
amino acid, such as one residue in the active site of an enzyme or a structure- 
determining residue, and 150 amino acids, as for whole protein domains. Most 
preferably, the length ranges between 3 and 25 amino acids, such as most 
commonly found in CDR loops of antibodies. 

The nucleic acid sequences could be RNA or. preferably. DNA. 
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In one embodiment, the (poly)peptides have an arr.ino acid pattern characteristic of 
a particular species. This can for example be achieved by deducing the consensus 
sequences from a collection of homologous proteins of just one species, most 
preferably from a collection of human proteins. Since the (poly)peptides comprising 
consensus sequences are artificial, they have to be compared to the protein 
sequence(s) having the closest similarity to ensure the presence of said 
characteristic amino acid pattern. 

In one embodiment, the invention provides for the creation of libraries of 
(poly)peptides comprising at least part of members or derivatives of the 
immunoglobulin superfamily. preferably of member or derivatives of the 
immnoglobulins. Most preferably, the invention provides for the creation of libraries 
of human antibodies, wherein said (poly)peptides are or are derived from heavy or 
light chain variable regions wherein said structural sub-elements are framework 
regions (FR) 1, 2, 3. or 4 or complementary determining regions (CDR) 1, 2, or 3. In a 
first step, a database of published antibody sequences of human origin is 
established where the antibody sequences are aligned to each other. The database 
is used to define subgroups of antibody sequences which show a high degree of 
similarity in both the sequence and the canonical fold of CDR loops (as determined 
by analysis of antibody structures). For each of the subgroups a consensus 
sequence is deduced which represents the members of this subgroup; the complete 
collection of consensus sequences represent therefore the complete structural 
repertoire of human antibodies. 

These artificial genes are then constructed e.g. by total gene synthesis or by the use 
of synthetic genetic subunits. These .genetic subunits correspond to structural sub- 
elements on the (poly)peptide level. On the DNA level, these genetic subunits are 
defined by cleavage sites at the start and the end of each of the sub-elements, which 
are unique in the vector system. All genes which are members of the collection of 
consensus sequences are constructed such that they contain a similar pattern of 
corresponding genetic sub-sequences. Mosl preferably, said (poly)peptides are or 
are derived from the HuCAL consensus genes: Vk1, Vk2, Vk3, Vk4. VM. VX2, VX3, 
VH1A, VH1B. VH2. VH3. VH4, VH5, VH6, Ck. CX, CH1 or any combination of said 
HuCAL consensus genes. 

This collection of DNA molecules can then be used to create libraries of antibodies 
or antibody fragments, preferably Fv, disulphide-linked Fv. single-chain Fv (scFv), or 
Fab fragments, which may be used as sources of specificities against new target 
antigens. Moreover, the affinity of the antibodies can be optimized using pre-buiit 
library cassettes and a general procedure. The invention provides a method for 
identifying one or more genes encoding one or more antibody fragments which 

-8- 

suBsnruTE sheet (rule 26) 



PCT/EP96/03647 

WO 97/08320 

binds to a target, comprising the steps of expressing the antibody fragments, and 
then screening them to isolate one or more antibody fragments which bind to a 
given target molecule. Preferably, an scFv fragment library compnsing the 
combination of HuCAL VH3 and HuCAL VX2 consensus genes and at least a 
random sub-sequence encoding the heavy chain CDR3 sub-element is screened for 
binding antibodies. If necessary, the modular design of the genes can then be used 
to excise from the genes encoding the antibody fragments one or more genetic sub- 
sequences encoding structural sub-elements, and replacing them by one or more 
second sub-sequences encoding structural sub-elements. The expression and 
screening steps can then be repeated until an antibody having the desired affinity is 
generated. 

Particularly preferred is a method in which one or more of the genetic subunits (e.g. 
the CDRs) are replaced by a random collection of sequences (the library) using the 
said cleavage sites. Since these cleavage sites are (I) unique in the vector system 
and (ii) common to all consensus genes, the same (pre-built) library can be inserted 
into all artificial antibody genes. The resulting library is then screened against any 
chosen antigen. Binding antibodies are selected, collected and used as starting 
material for the next library. Here, one or more of the remaining genetic subunits are 
randomized as described above. 

A further embodiment of the present invention relates to fusion proteins by providing 
for a DNA sequence which encodes both the (poly)peptide. as described above, as 
well as an additional moiety. Particularly preferred are moieties which have a useful 
therapeutic function. For example, the additional moiety may be a toxin molecule 
which is able to kill cells (Vitetta et al.. 1993). There are numerous examples of such 
toxins, well known to the one skilled in the art. such as the bacterial toxins 
Pseudomonas exotoxin A, and diphtheria toxin, as well as the plant toxins ncin. 
abrin. modeccin. saporin. and gelonin. By fusing such a toxin for example to an 
antibody fragment, the toxin can be targeted to. for example, diseased cells, and 
thereby have a beneficial therapeutic effect. Alternatively, the additional moiety may 
be a cytokine, such as lL-2. (Rosenberg & Lotze. 1 986). which has a particular effect 
(in this case a T-cell proliferative effect) on a family of cells. In a further embodiment, 
the additional moiety may confer on its (poly)peptide partner a means of detection 
and/or purification. For example, the fusion protein could comprise the modified 
antibody fragment and an enzyme commonly used for detection purposes, such as 
alkaline phosphatase (Blake et al., 1984). There are numerous other moieties which 
can be used as detection or purification tags, which are well known to the 
practitioner skilled in the art. Particularly preferred are peptides comprising at least 
five histidine residues (Hochuli et al., 1988), which are able to bind to metal ions, 
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and can therefore be used for the purification of the protein to which they are fused 
(Lindner et al.. 1992). Also provided for by the invention are additional moieties such 
as the commonly used C-myc and FI-AG tags (Hopp et al., 1988; Knappik & 
Pluckthun. 1994). 

By engineering one or more fused additional domains, antibody fragments or any 
other (poly)peptide can be assembled into larger molecules which also fall under 
the scope of the present invention. For example, mini-antibodies (Pack, 1994) are 
dimers comprising two antibody fragments, each fused to a self-associating 
dimerization domain. Dimerization domains which are particularly preferred include 
those derived from a leucine zipper (Pack & Pluckthun, 1992) or helix-turn-helix 
motif (Pack et al.. 1 993). 

All of the above embodiments of the present invention can be effected using 
standard techniques of molecular biology known to anyone skilled in the art. 

In a further embodiment, the random collection of sub-sequences (the library) is 
inserted into a singular nucleic acid sequence encoding one (poly)peptide, thus 
creating a (poly)peptide library based on one universal framework. Preferably a 
random collection of CDR sub-sequences is inserted into a universal antibody 
framework, for example into the HuCAL H3k2 single-chain Fv fragment described 
above. 

In further embodiments, the invention provides for nucleic acid sequence(s). 
vector(s) containing the nucleic acid sequence(s), host cell(s) containing the 
vector(s), and (poly)peptides, obtainable according to the methods described above. 

In a further preferred embodiment, the invention provides for modular vector systems 
being compatible with the modular nucleic acid sequences encoding the 
(poly)peptides. The modules of the vectors are flanked by restriction sites unique 
within the vector system and essentially unique with respect to the restriction sites 
incorporated into the nucleic acid sequences encoding the (poly)peptides, except 
for example the restriction sites necessary for cloning the nucleic acid sequences 
into the vector. The list of vector modules comprises origins of singte-stranded 
replication, origins of double-stranded replication for high- and low copy number 
plasmids, promotor/operalor, repressor or terminator elements, resistance genes, 
potential recombination sites, gene 111 for display on filamentous phages, signal 
sequences, purification and detection tags, and sequences of additional moieties. 
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The vectors are preferably, but not exclusively, expression vectors or vectors 
suitable for expression and screening of libraries. 



In another embodiment, the invention provides for a kit, comprising one or more of 
the list of nucleic acid sequence{s), recombinant vector(s). (poly)peptide(s), and 
vector(s) according to the methods described above, and suitable host cell(s) for 
producing the (poly)peptide(s). 

In a preferred embodiment, the invention provides for the creation of libraries of 
human antibodies. In a first step, a database of published antibody sequences of 
human origin is established. The database is used to define subgroups of antibody 
sequences which show a high degree of similarity in both the sequence and the 
canonical fold (as determined by analysis of antibody structures). For each of the 
subgroups a consensus sequence is deduced which represents the members of this 
subgroup: the complete collection of consensus sequences represent therefore the 
complete structural repertoire of human antibodies. 

These artificial genes are then constructed by the use of synthetic genetic subunits. 
These genetic subunits correspond to structural sub-elements on the protein level. 
On the DNA level, these genetic subunits are defined by cleavage sites at the start 
and the end of each of the subelements. which are unique in the vector system. All 
genes which are members of the collection of consensus sequences are 
constructed such that they contain a similar pattern of said genetic subunits. 

This collection of DNA molecules can then be used to create libraries of antibodies 
which may be used as sources of specificities against new target antigens. 
Moreover, the affinity of the antibodies can be optimised using pre-built library 
cassettes and a general procedure. The invention provides a method for identifying 
one or more genes encoding one or more antibody fragments which binds to a 
target, comprising the steps of expressing the antibody fragments, and then 
screening them to isolate one or more antibody fragments which bind to a given 
target molecule. If necessary, the modular design of the genes can then be used to 
excise from the genes encoding the antibody fragments one or more genetic sub- 
sequences encoding structural sub-elements, and replacing them by one or more 
second sub-sequences encoding structural sub-elements. The expression and 
screening steps can then be repeated until an antibody having the desired affinity is 
generated. 
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Particularly preferred is a method in which one or more of the yenetic subunits (e.g. 
the CDR's) are replaced by a random collection of sequences (the library) using the 
said cleavage sites. Since these cleavage sites are (i) unique in the vector system 
and (ii) common to all consensus genes, the same (pre-built) library can be inserted 
into all artificial antibody genes. The resulting library is then screened against any 
chosen antigen. Binding antibodies are eluted, collected and used as starting 
material for the next library. Here, one or more of the remaining genetic subunits are 
randomised as described above. 
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Definitions 

Eroteip: 

The term protein comprises monomeric polypeptide chains as well as homo- or 
heteromultimeric complexes of two or more polypeptide chains connected either by 
covalent interactions (such as disulphide bonds) or by non-covalent interactions 
(such as hydrophobic or electrostatic interactions). 

An al ysis of homologous proteins: 

The amino acid sequences of three or more proteins are aligned to each other 
(allowing for introduction of gaps) in a way which maximizes the correspondence 
between identical or similar amino acid residues at ail positions. These aligned 
sequences are termed homologous if the percentage of the sum of identical and/or 
similar residues exceeds a defined threshold. This threshold is commonly regarded 
by those skilled in the art as being exceeded when at least 15% of the amino acids 
in the aligned genes are identical, and at least 30% are similar. Examples for 
families of homologous proteins are: immunoglobulin superfamily, scavenger 
receptor superfamily, fibronectin superfamilies (e.g. type II and 111), complement 
control protein superfamily, cytokine receptor superfamily, cystine knot proteins, 
tyrosine kinases, and numerous other examples well known to one of ordinary skill 
in the art. 

Consensus sequence: 

Using a matrix of at least three aligned amino acid sequences, and allowing for 
gaps in the alignment, it is possible to determine the most frequent amino acid 
residue at each position. The consensus sequence is that sequence which 
comprises the amino acids which are most frequently represented at each position. 
In the event that two or more amino acids are equally represented at a single 
position, the consensus sequence includes both or all of those amino acids. 

Removing unfavorable interactions: 

The consensus sequence is per se in most cases artificial and has to be analyzed in 
order to change amino acid residues which, for example, would prevent the 
resulting molecule to adapt a functional tertiary structure or which would block the 
interaction with other (poly)peptide chains in multimeric complexes. This can be 
done either by (i) building a three-dimensional model of the consensus sequence 
using known related structures as a template, and identifying amino acid residues 
within the model which may interact unfavorably with each other, or (ii) analyzing the 
matrix of aligned amino acid sequences in order to detect combinations of amino 
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acid residues within tiie sequences which frequsnt'.y occur together in one 
sequence and are therefore lil<ely to interact with each other. These probable 
interaction-pairs are then tabulated and the consensus is compared with these 
"interaction maps". Missing or wrong interactions in the consensus are repaired 
accordingly by introducing appropriate changes in amino acids which minimize 
unfavorable interactions. 

|Hpntifioation of structural siih-elements: 

Structural sub-elements are stretches of amino acid residues within a 
protein/(poly)peptide which correspond to a defined structural or functional part of 
the molecule. These can be loops (e.g. CDR loops of an antibody) or any other 
secondary or functional structure within the protein/(poly)peptide (domains, a- 
helices, B-sheets, framework regions of antibodies, etc.). A structural sub-element 
can be identified using known structures of similar or homologous (poly)peptides, or 
by using the above mentioned matrices of aligned amino acid sequences. Here the 
variability at each position is the basis for determining stretches of amino acid 
residues which belong to a structural sub-element (e.g. hypervariable regions of an 
antibody). 

Siih-sequence: 

A sub-sequence is defined as a genetic module which is flanked by unique 
cleavage sites and encodes at least one structural sub-element. It is not necessarily 
identical to a structural sub-element. 



Cifiavaae site: 

A short DNA sequence which is used as a specific target for a reagent which 
cleaves DNA in a sequence-specific manner (e.g. restriction endonucleases). 

Compatible deavaoe sites: 

Cleavage sites are compatible with each other, if they can be efficiently ligated 
without modification and. preferably, also without adding an adapter molecule.. 

Unique cleavage sites: 

A cleavage site is defined as unique if it occurs only once in a vector containing at 
least one of the genes of interest, or if a vector containing at least one of the genes 
of interest could be treated in a way that only one of the cleavage sites could be 
used by the cleaving agent. 



-14- 

SUBSTTTUTE SHEET (RULE 26) 



wo 97/08320 



PCT/EP96/03647 



f. prmspondin9 foolv^De ptiHP sequences: 

Sequences deduced from the same part of one group of homologous proteins are 
called corresponding (poly)peptide sequences. 

f ^nmmon cleavage sites: 

A cleavage site in at least two corresponding sequences, which occurs at the same 
functional position (i.e. which flanks a defined sub-sequence), which can be 
hydrolyzed by the same cleavage tool and which yields identical compatible ends is 
termed a common cleavage site. 

pyriginn aenptin sub-senuences: 

A method which uses the unique cleavage sites and the corresponding cleavage 
reagents to cleave the target DNA at the specified positions in order to isolate, 
remove or replace the genetic sub-sequence flanked by these unique cleavage 
sites. 

Fxnhanaina genetic sub-seouences: 

A method by which an existing sub-sequence is removed using the flanking 
cleavage sites of this sub-sequence, and a new sub-sequence or a collection of 
sub-sequences, which contain ends compatible with the cleavage sites thus 
created, is inserted. 

Fx pression of genes: 

The term expression refers to in vivo or in vitro processes, by which the information 
of a gene is transcribed into mRNA and then translated into a protein/(poiy)peptide. 
Thus, the term expression refers to a process which occurs inside cells, by which the 
information of a gene is transcribed into mRNA and then into a protein. The term 
expression also includes all events of post-translational modification and transport, 
which are necessary for the (poly) peptide to be functional. 

Rnrfisnina nf prntein/fp nlvVpentide libraries: 

Any method which allows isolation of one or more proteins/(poly)peptides having a 
desired property from other proteins/(poly)peptides within a library. 

Amino acid pattsm cha ranteristic for a species: 

A (poly)peptide sequence is assumed to exhibit an amino acid pattern characteristic 
for a species if it is deduced from a collection of homologous proteins from just this 
species. 



-15- 

suBsnruTE sheet (rule 26) 



PCT/EP96/03647 

WO 97/08320 

ImnpiinoQiobuiin siinprff^milv (IgSF): 

The IgSF is a family of proteins comprising domains being charactenzed by the 
immunoglobulin fold. The IgSF comprises for example T-cell receptors and the 
immunoglobulins (antibodies). 

/^ nti'">f>dv framework: 

A framework of an antibody variable domain is defined by Kabal et al. (1 991) as the 
part of the variable domain which serves as a scaffold for the antigen bmding loops 
of this variable domain. 

z^ntihnriv CDB: 

The CDRs (complementarity determining regions) of an antibody consist of the 
antigen binding loops, as defined by Kabat et al. (1991). Each of the two vanable 
domains of an antibody Fv fragment contain three CDRs. 

for Muman Combinatorial Antibody Library. Antibody Library based on 
modular consensus genes according to the invention (see Example 1). 

/j^n^ihnriy fragment: 

Any portion of an antibody which has a particular function, e.g. binding of antigen. 
Usually, antibody fragments are smaller than whole antibodies. Examples are Fv. 
disulphide-linked Fv. single-chain Fv (scFv), or Fab fragments. Additionally, antibody 
fragments are often engineered to include new functions or properties. 

L'rivf r^«' ^'•^"igwork: 

One single framework which can be used to create the full variability of functions, 
specificities or properties which is originally sustained by a large collection of 
different frameworks, is called universal framework. 

R jnriinn nt an fir!i^'"dv tn its taroet: 

The process which leads to a light and specific association between an antibody 
and a corresponding molecule or ligand is called binding. A molecule or ligand or 
any part of a molecukle or ligand which is recognized by an antibody is called the 
target. 

pppiarinq genetic s iih-seouences 

A method by which an existing sub-sequence is removed using the flanking 
cleavage sites of this sub-sequence, and a new sub-sequence or collection of sub- 
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sequences, which contains ends compatible with Vr.o cisavag. s'tes thus created, is 
inserted. 

A^g;fimhlinq nf qfinetic <;fiqiJences: 

r^^^^ synthetic or natural genetic sequences m a 

specific manner in order to get longer genetic sequences which contain at least 
parts of the used synthetic or natural genetic sequences. 

Ao>.iweic nf hnmoloQOi is genes: 

?;;e orresponding amino add sequences of .wo or mo,e genes are aligned to each 
other in a way which maximizes the correspondence between identical or similar 
amino acid residues at all positions. These aligned sequences are lermed 
homologous it the percentage of the sum of Identical and/or similar residues 
exceeds a defined threshold. This threshold is commonly regarded by those skil ed 
in the art as t>eing exceeded when at least 15 per cent ol the amino acids in the 
aligned genes are identical, and at least 30 per cent are similar. 
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Legends to Figures and Tables 

Fig. 1: Flow chart outlining the process of construction of a synthetic human 

antibody library based on consensus sequences. 
Fig. 2: Alignment of consensus sequences designed for each subgroup (amino 
acid residues are shown with their standard one-letter abbreviation). (A) 
kappa sequences, (B) lambda sequences and (C). heavy chain 
sequences. The positions are numbered according to Kabat (1991). In 
order to maximize homology in the alignment, gaps (-) have been 
introduced in the sequence at certain positions. 
Fig. 3: Gene sequences of the synthetic V kappa consensus genes. The 
corresponding amino acid sequences (see Fig. 2) as well as the unique 
cleavage sites are also shown. 
Fig. 4: Gene sequences of the synthetic V lambda consensus genes. The 
corresponding amino acid sequences (see Fig. 2) as well as the unique 
cleavage sites are also shown. 
Fig. 5: Gene sequences of the synthetic V heavy chain consensus genes. The 
corresponding amino acid sequences (see Fig. 2) as well as the unique 
cleavage sites are also shown. 
Fig. 6: Oligonucleotides used for construction of the consensus genes. The 
oligos are named according to the corresponding consensus gene, e.g. 
the gene VkI was constructed using the six oligonucleotides 01K1 to 
01 K6. The oligonucleotides used for synthesizing the genes encoding 
the constant domains Ok (OCLKI to 8) and CH1 (0CH1 to 8) are also 
shown. 

Fig. 7A/B: Sequences of the synthetic genes encoding the constant domains Ck 
(A) and cm (B). The corresponding amino acid sequences as well as 
unique cleavage sites introduced in these genes are also shown. 

Fig. 7C: Functional map and sequence of module M24 comprising the synthetic 
CX gene segment (huCL lambda). 

Fig. 7D: Oligonucleotides used for synthesis of module M24. 

Fig. 8: Sequence and restriction map ol the synthetic gene encoding the 
consensus single-chain fragment VH3-Vk'2. The signal sequence (amino 
acids 1 to 21) was derived from the E. coli phoA gene (Skerra & 
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Pluckthun. 1988). Between the phoA signal sequsnce and the Vh3 
domain, a short sequence stretch encoding 4 amino acid residues (amino 
acid 22 to 25) has been inserted in order to allow detection of the single- 
chain fragment in Western blot or ELISA using the monoclonal antibody 
M1 (Knappik & Pluckthun, 1994). The last 6 basepairs of the sequence 
were introduced for cloning purposes (EcoRI site). 
Fig. 9: Plasmid map of the vector plG10.3 used for phage display of the H3k2 
scFv fragment. The vector is derived from pIGIO and contains the gene for 
the lac operon repressor, lad, the artificial operon encoding the H3k2- 
gene3ss fusion under control of the lac promoter, the Ipp terminator of 
transcription, the single-strand replication origin of the E. coli phage fl 
(F1_0RI), a gene encoding p-Iactamase (bla) and the ColEI derived 
origin of replication. 

Fig. 10: Sequencing results of independent clones from the initial library. 

translated into the corresponding amino acid sequences. (A) Amino acid 
sequence of the VH3 consensus heavy chain CDR3 (position 93 to 102, 
Kabat numbering). (B) Amino acid sequences of. 1 2 clones of the 1 0-mer 
library. (C) Amino acid sequences of 11 clones of the 15-mer library. *: 
single base deletion. 

Fig. 11: Expression test of individual library members. (A) Expression of 9 
independent clones of the 10-mer library. (B) Expression of 9 
Independent clones of the 15-mer library. The lane designated with M 
contains the size marker. Both the gp3-scFv fusion and the scFv monomer 
are indicated. 

Fig. 12: Enrichment of specific phage antibodies during the panning against FITC- 
BSA. The initial as well as the subsequent fluorescein-specific sub- 
libraries were panned against the blocking buffer and the ratio of the 
phage eluted from the FITC-BSA coated well vs. that from the powder milk 
coated well from each panning round is presented as the ^specificity 
factor". 

Fig. 13: Phage ELISA of 24 independent clones after the third round of panning 

tested for binding on FITC-BSA. 
Fig. 14: Competition ELISA of selected FITC-BSA binding clones. The ELISA 

signals (OD,o5„m) of scFv binding without inhibition are taken as 100%. 
Fig. 15: Sequencing results of the heavy chain GDR3s of independent clones 

after 3 rounds of panning against FITC-BSA. translated into the 

corresponding amino acid sequences (position 93 to 102, Kabat 

numbering). 
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Fig. 16: Coomassie-Blue stained SDS-PAGE of the purifrcd anti-fluorescfiin scFv 
fragments: M: molecular weight marker. A: total soluble cell extract after 
induction, B: fraction of the flow-through, C. D and E: purified scFv 
fragments 1HA-3E4, 1HA-3E5 and 1HA-3E10, respectively. 

Fig. 17: Enrichment of specific phage antibodies during the panning against (3- 
estradiol-BSA. testosterone-BSA, BSA, ESL-1, interleukin-2, 
lymphotoxin-B, and LeY-BSA after three rounds of panning. 

Fig. 18: ELISA of selected ESL-1 and B-estradio! binding clones 

Fig. 19: Selectivity and cross-reactivity of HuCAL antibodies: in the diagonal 
specific binding of HuCAL antibodies can be seen, off-diagonal signals 
show non-specific cross-reactivity. 

Fig. 20: Sequencing results of the heavy chain CDRSs of independent clones 
after 3 rounds of panning against B-estradioI-BSA. translated into the 
corresponding amino acid sequences (position 93 to 102, Kabal . 
numbering). One clone is derived from the lOmer library. 

Fig. 21: Sequencing results of the heavy chain CDR3S of independent clones 
after 3 rounds of panning against testosterone-BSA, translated into the 
corresponding amino acid sequences (position 93 to 102. Kabat 
numbering). 

Fig. 22: Sequencing results of the heavy chain CDR3s of independent clones 
after 3 rounds of panning against lymphotoxin-B, translated into the 
corresponding amino acid sequences (position 93 to 102, Kabat 
numbering). One clone comprises a 14mer CDR, presumably introduced 
by incomplete coupling of the trinucleotide mixture during oligonucleotide 
synthesis. 

Fig. 23: Sequencing results of the heavy chain CDR3s of independent clones 
after 3 rounds of panning against ESL-1, translated into the 
corresponding amino acid sequences (position 93 to 102. Kabat 
numbering). Two clones are derived from the lOmer library. One clone 
comprises a 16mer CDR, presumably introduced by chain elongation 
during oligonucleotide synthesis using trinucleotides. 

Fig. 24: Sequencing results of the heavy chain CDR3s of independent clones 
after 3 rounds of panning against BSA. translated, into the corresponding 
amino acid sequences (position 93 to 102, Kabat numbering). 

Fig. 25: Schematic representation of the modular pCAL vector system. 

Fig. 25a: List of restriction sites already used in or suitable for the modular HuCAL 
genes and pCAL vector system. 

Fig. 26: List of the modular vector elements for the pCAL vector series: shown are 
only those restriction sites which are part of the modular system. 
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Fig. 27 
Fig. 28 
Fig. 29 
Fig. 30 
Fig. 31 
Fig. 32 
Fig. 33 



Functional map and sequence of the multi-cloning site module ',MCS) 



Functional map and sequence of the pMCS cloning vector series. 
Functional map and sequence of the pCAL module Ml (see Fig. 26). 
Functional map and sequence of the pCAL module M7-III (see Fig. 26). 
Functional map and sequence of the pCAL module M9-II (see Fig. 26). 
Functional map and sequence of the pCAL module M11-II (see Fig. 26). 
Functional map and sequence of the pCAL module M14-Ext2 (see Fig. 
26). 

Fig. 34: Functional map and sequence of the pCAL module M17 (see Fig. 26). 

Fig. 35: Functional map and sequence of the modular vector pCAL4. 

Fig. 35a: Functional maps and sequences of additional pCAL modules (M2, M3, 
M7I. M7II. M8. M10II, M11II, M12, M13. m9, M20. M21, M41) and of low- 
copy number plasmid vectors (pCALOl to pCAL03). 

Fig. 35b: List of oligonucleotides and primers used for synthesis of pCAL vector 
modules. 

Fig. 36: Functional map and sequence of the B-lactamase cassette for 
replacement of CDRs for CDR library cloning. 

Fig: 37: Oligo and primer design for Vk CDRS libraries 

Fig. 38 : Oligo and primer design for VX. CDR3 libraries 

Fig. 39: Functional map of the pBS13 expression vector series. 

Fig. 40: Expression of all 49 HuCAL scFvs obtained by combining each of the 7 
VH genes with each of the 7 VL genes (pBS13, SCC): Values are given 
for the percentage of soluble vs. Insoluble material, the total and the 
soluble amount compared to the combination H3k2. which was set to 
100%. In addition, the corresponding values for the McPC603 scFv are 
given. 

Table 1: Summary of human immunoglobulin germline sequences used for 
computing the germline membership of rearranged sequences. (A) kappa 
sequences. (B) lambda sequences and (C), heavy chain sequences. (1) 
The germline name used in the various calculations, (2) the references 
number for the corresponding sequence (see appendix for sequence 
related citations), (3) the family where each sequence belongs to and (4), 
the various names found in literature lor germline genes with identical 
amino acid sequences. 

Table 2: Rearranged human sequences used for the calculation of consensus 
sequences. (A) kappa sequences, (B) lambda sequences and (C), heavy 
chain sequences. The table summarized the name of the sequence (1), 
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Table IB: Human lambda germline gene segments 
Used Name' Reference' Family' Germline genes* 



DPL1 1 1 

DPL2 1 1 HUMLV1L1 

DPL3 1 1 HUMLV122 

DPL4 1 1 VLAMBDAl.l 

HUMLVn? 2 1 

DPL5 1 1 HUMLVn7D 

DPL6 1 1 

DPL7 1 1 IGLV1S2 

DPL8 1 1 HUMLV1042 

DPL9 1 1 HUMLV101 

DPLIO 1 2 

VLAMBDA2.1 3 2 

DPLll 1 2 

DPL12 1 2 

DPL13 1 2 

DPL14 1 2 

DPL16 1 3 Humlv418;IGLV3Sl 

DPL23 1 3 VI 111.1 

Humlv318 4 3 

DPL18 1 7 4A;HUMIGLVA 

DPL19 1 7 

DPL21 1 8 VL8.1 

HUMLV801 5 8 

DPL22 1 9 

DPL24 1 unassigned VLAMBOA N.2 

gVLX-4.4 e 10 
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Table IC: Human heavy chain germline gene segments 



Used Name^ Reference' Fam 



ly' Germline genes' 



VHl-12-1 


19 




DP 10: DA-2;DA-6 


VH1-12-8 


22 


1 


RR.VH1:2 


VHl-12-2 


6 




hv1263 


VH 1-1 2-9 


7 




YAC-7;RR.VH1.1; 1-69 


VH 1-1 2-3 


19 




DP3 


VH 1-1 2-4 


19 


1 


DP21:4d275a;VH7a 


\/Hl-l2-5 


18 


1 


l-4.1b:V1-4.1b 


VH1-12-6 


21 




lD37;VH7b ;7-81;YAC-10 


VH1-12-7 


19 




DP14:VH1GRR:V1-18 


VH1-13-1 


10 




71-5: DP2 


VH 1-13-2 


10 




E3-10 


VH1-13-3 


19 


1 


DPI 


\/Hl-l3-4 


12 


1 


V35 


VH1-13-5 


8 




VI -2b 


VH1-13-6 


18 


1 


l-2;DP75 


VH1-13-7 


21 


1 


VI -2 


VH 1-1 3-8 


19 


1 


DPS 


VHl-13-9 


3 




1-1 


VH1-13-10 


19 




DP12 


VH1-13-1 1 


15 




V13C 


VH1-13-12 


18 


1 


l-3b: DP25:V1-3b 


VH1-13-13 


3 




1-92 


VH1-13-14 


18 


1 


l-3;Vl-3 


VH1-13-15 


19 


1 


DPl5;Vl-8 


VH1-13-16 


3 


1 


21-2:3-1: DP7;V1-46 


VH1-13-17 


16 


1 


HG3 


VHl-13-18 


19 




DP4;7-2:Vl-45 


VH1-13-19 


27 




COS 5 


VH1-1X-1 


19 




DP5: 1-24P 


VH2-21-1 


18 


2 


ll-5b 


VH2-31-1 


2 


2 


VH2S12-1 


VH2-31-2 


2 


2 


VH2S12-7 


VH2-31-3 


2 


2 


VH2512-9; DP27 


VH2-31-4 


2 


2 


VH2S12-10 


VH2-31-5 


14 


2 


V2-26; DP2G; 2-26 


VH2-31-6 


15 


2 


VF2-26 
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Table IC: (continued) 



Used Name' 


Reference' 


Family' 


Germline genes' 


VH2-31-7 


19 


2 


DP28:DA-7 


VH2-31-14 


7 


2 


YAC-3; 2-70 


VH2-31-8 


2 


2 


VH2S12-5 


VH2-31-9 


2 


2 


VH2S12-12 


\/H2-31-10 


18 


2 


11-5; V2-5 


\/H9-11-1 1 


2 


2 


VH2Sl2-2;VH2Sl2-8 


\/H9-ll-12 


2 


2 


VH2Sl2-4:VH2Sl2-6 


VH2-31-13 


2 


2 


VH2S12-14 


\/H3-1 1-1 


13 


3 


v65-2; DP44 


VH3-n-2 


19 


3 


DP45 


VH3-1 1-3 


3 


3 


13-2; DP48 


VH3-11-4 


19 


3 


DP52 


VH3-11-5 


14 


3 


v3-l3 


VH3-11-G 


19 


3 


DP42 


VH3-11-7 


3 


3 


8-1 B;YAC-5: 3-66 


VH3-11-8 


14 


3 


V3-53 


VH3-13-1 


3 


3 


22-2B;DP35;V3-11 


\/H3-13-5 

V 1 1 0 1 «^ *J 


19 


3 


DP59:VH19;V3-35 


VH3-13-6 


25 


3 


fl-pl; DP61 


VH3-13-7 


19 


3 


DP46: GL-SJ2; COS 8; hv3005; hv3005f3; 3d21b; 56p1 


VH3-13-8 


24 


3 


VH26 


VH3-13-9 


5 


3 


vh26c 


VH3-13-10 


19 


3 


DP47;VH26; 3-23 


VH3-13-n 


3 


3 


1-91 


VH3-13-12 


19 


3 


DP58 


VH3-13-13 


3 


3 


1-9III; DP49;3-30; 3d28.1 


VH3-13-14 


24 


3 


301989; DP50; 3-33; 3d277 


VH3-13-15 


27 


3 


COS 3 


VH3-13-16 


19 


3 


DP51 


VH3-13-17 


16 


3, 


H11 


VH3-13-18 


19 


3 


DPS3; COS G; 3-74; DA-8 


VH3-13-19 


19 


3 


DP54;VH3-ll;V3-7 


VH3-13-20 


14 


3 


V3-64; YAC-6 


VH3-13-21 


14 


3 


V3-48 


VH3- 13-22 


14 


3 


V3-43; DP33 


VH3-13-23 


14 


3 


V3-33 
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Table IC: (continued) 



Used Name' 


Reference' 


Family' 


Germline genes* 


VH3- 13-24 


14 


3 


V3-21; DP77 


VH3-13-25 


14 


3 


V3-20; DP32 


VH3-13-26 


14 


3 


V3-9;DP31 


VH3-14-1 


3 


3 


12-2: DP29; 3-72; DA-3 


VH3-14-4 


7 


3 


YAC-9; 3-73; MTGL 


VH3-14-2 


4 


3 


VHD26 


VH3-14-3 


19 


3 


DP30 


\/H3-lX-l 


1 


3 


LS68.1: LSG9.1: LSGlO.l: HUM12IGVH; HUM131GVH 


VH3-1X-2 


1 


3 


LSGn.l;HUM4IGVH 


VH3-1X-3 


3 


3 


9-1; DP38; LSG7.1; RCGl.l; LSGl.1; LSG3.1; LSG5.1: 








nuMlSlbVH; nUMilbVn, nuiviyiuvn 


VH3-1X-4 


1 


3 


LSG4,1 


VH3-1X-5 


1 


3 


L5G2.1 


VH3-1X-6 


1 


3 


LSG6.1; HUMlOiGVH 


VH3-1X-7 


18 


3 


3-15; V3-15 


VH3-1X-8 


1 


3 


LSG12.1; HUM5IGVH 


VH3-1X-9 


14 


3 


V3-49 


VH4-11-1 


22 


4 


lOU-Vn4.zl 


VH4-11-2 


17 


4 


vyu>i oi . nDCT- \/WC^* AriTfi- MA^IA 
Vn4.zl, UrbJ, Vno, h-u/d, vh ot 


VH4-11-3 


23 


4 


4.44 


VH4-11-4 


23 


4 


4.44.3 


VH4-11-5 


23 


4 


4.36 


VH4-11-6 


23 


4 


4.0/ 


\/H4-n-7 


18 


A 

4 


IV-4, 4.Jo, V4-4 


\/H4-1 1-8 


1 7 


A 

4 


VHd 11 ' "^d 1 Q7d- DP7 1 * 58d2 


VH4-11-9 


20 


4 


lit 
n/ 


VH4-n-10 


20 


4 


no 


\/H4-l 1-1 1 




A 


H9 


VH4-11-12 


17 


4 


VH4.16 


VH4-11-13 


23 


4 


4.38 


VH4-11-14 


17 


4 


VH4.15 


VH4-n-15 


11 


4 


58 


VH4-11-16 


10 


4 


71-4; V4-59 


VH4-21-1 


11 


4 


11 


VH4-21-2 


17 


4 


VH4.17; VH4.23; 4d255; 4.40; DPG9 


VH4-21-3 


17 


4 


VH4,19; 79;V4-4b 



5/ 



SUBSTITUTE SHEET (RULE 26) 
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Used Name' Reference' Family' Germline genes* 



VH4-21-4 


19 


4 


DP70; 4dG8; 4.41 


VH4-21-5 


19 


4 


DP67;VH4-4B 


VH4-21-6 


17 


4 


VH4.22;\/HSP;VH-JA 


VH4-21-7 


17 


4 


VH4.13; 1-911; 12G-1; 3d28d: 4.42; DP68; 4-28 


\/H4-21-8 


26 


4 


hv4005; 3d24d 


vn*t~c. 1 ^ 


17 


4 


VH4.14 


Vn*r"0 I 1 


23 


4 


4.34: 3d230d; DP78 


\/UA 


23 


4 


4.34.2 


\/HA-l 1 
VnH- J 1 o 


19 


4 


DP64;3d216d 




19 


4 


DPG5:4-31;3d277d 


vn*T o 1 *i 


23 


4 


4.33; 3d75d 


vn*T" J 1 u 


20 


4 


HIO 


vn*T~*?i / 


20 


4 


Hll 


VH4-31-8 


23 


4 


4.31 


\/H4-31-9 


23 


4 


4.32 


\/H4-31-10 

V 1 l~ w • t w 


20 


4 


3d277d 


VH4-31-11 


zu 


4 


3d216d 


VH4-31-12 


20 


A 

4 




VH4-31-li 


1 7 
1 / 


A 

•T 


VH4.18; 4dl54; DP79 


VH4-3 1-14 


p 
o 


4. 


V4-39 


VH4-31-15 


1 1 


*r 


2-1;DP79 


VH4-31-16 


23 


4 


4.30 


VH4-31-17 


17 


4 


VH4.12 


VH4-31-18 


10 


4 


71-2; DP66 


VH4-31-19 


23 


4 


4.39 


VH4-31-20 


8 


4 


V4-G1 


VH5-12-1 


9 


5 


\/H251 ; DP73; VHVCW; 51 -R1 : VHVLB; VHVCH; VHVTT; 






VHVAU; VHVBLK; VhAU; V5-51 


VH5-12-2 


17 


5 


VHVJB 


VH5-12-3 


3 


5 


1-v;DP80; 5-78 


VH5-12-4 


9 


5 


VH32; VHVRG; VHVMW; 5-2R1 


VH6-35-1 


4 


6 


VHVI- VH6; VHVIIS; VHVITE; VHVIJB; VHVICH; VHVICW; 






VHVIBLK;VHVIMW: DP74; 6-lGl;V6-1 
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Table 2A: rearranged human kappa sequences 





aa^ Computed 
family^ 


Germline 
gene* 


Diff. to 
germline* 


% diff. to 
germline^ 


Reference' 


I1I-3R 


108 1 


08 


1 


1 1 1 /u 


70 


No.86 


109 1 


08 


3 




80 


AU 


108 1 


08 


6 




103 


ROY 


108 1 


no 


D 




43 


IC4 


108 1 


08 


C 




70 


HIV-B26 


10G 1 


08 


3 




8 


6RI 


108 1 


08 


8 




30 


AG 


106 1 


08 


8 




1 Ifi 


REI 


108 1 


08 


9 


9.5% 


00 


CLL PATIENT 16 


88 1 


08 


2 


*i on//* 
2,3Wo 


199 


CLL PATIENT 14 


87 1 


08 


2 




199 


CLL PATIENT 15 


88 1 


08 


2 




122 


GM4672 


108 1 


08 


1 1 


1 1 fiO/n 


24 


HUM. YFC51.1 


108 1 


08 


12 




1 10 


LAY 


108 1 


08 


1 0 


19 fitVo 


48 


HIV-b13 


106 1 


08 


Q 

y 


Q 70/n 


8 


MAL-NaCI 


108 1 


08 


13 




102 


STRAb SA-lA 


108 1 


02 


0 


U.UVfO 


190 


HuVHCAMP 


108 1 


08 


13 


lo,/"/o 


inn 


CRO 


108 1 


02 


10 






Am 107 


108 1 


02 


12 


1 2,6% 




WALKER 


107 1 


02 


4 




R7 


II1-2R 


109 1 


A20 


0 


U.UVrO 


70 


F0G1-A4 


107 1 


A20 


4 




41 


HK137 


95 1 


Ll 


0 




10 


CEA4-8A 


107 1 


02 


/ 


7 dO/n 


41 


Va" 


95 1 


L4 


ft 

u 


n no/n 


90 


TR1.21 


108 1 


02 


4 


4. 90/n 


92 


HAU 


108 1 


02 


6 


6.30/0 


123 


HK102 


95 1 


L12(l) 


0 


0.0% 


9 


H20C3K 


108 1 


L12(2) 


3 


3.2% 


125 


CHEB 


108 1 


02 


7 


7.4% 


5 


HK134 


95 1 


L15(2) 


0 


0.0% 


10 


TEL9 


108 1 


02 


9 


9.5% 


73 



53 
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1 vol 11 V 


aa' Computed 
family^ 


Germline 
gene* 


Diff. to 
germline* 


% diff. to 
germline* 


Referenc 


TR1.32 


103 1 


no 




3.2% 


92 
121 


RF-KES1 


97 1 




A 


4,2% 


WES 


108 - 1 


L5 


in 




61 


OlLpI 


95 1 




1 
1 


1,1% 


70 


SA-4B 


107 1 


LI 2(2) 


o 
o 


Q AO/ft 


120 


HKlOl 


95 1 


L15(l) 


U 


n no/n 


9 


TR1.23 


108 1 


02 


r 

5 




92 


HF2-1/17 


108 1 


A30 


u 


U,UVfO 


A 


2E7 


108 1 


A30 


1 




0^ 


33.C9 


107 1 


L12(2) 


7 




1 ^0 


3D6 


105 1 


L12(2) 


2 


Z, 1 "/O 


34 


1-23 


108 1 


L8 


o 
o 




- 70 


RF-KL1 


97 1 


L8 


A 

4 


A 90/a 


121 


TNF-E7 


108 1 


A30 


9 




41 


TR1.22 


108 1 


02 


1 






HIV-B35 


106 1 


02 


2 


9 90/n 




HIV-b22 


106 1 


02 


2 


z.zMro 


Q 
0 


HIV-b27 


106 1 


02 


2 


1 on/** 

Z,ZVfO 


0 


HIV-B8 


107 1 


02 


10 


lVJ,O"/0 


Q 
0 


HIV-b8 


107 1 


02 


10 


lU.oMro 


0 


RF-SJ5 


95 1 


A30 


5 






GAL(I) 


108 1 


A30 


b 


R 10/n 


64 


R3.5H5G 


108 1 


Ui 


c 

D 




70 


HlV-bl4 


10G 1 


AzU 


z 


? 50/0 


8 


TNF-El 


105 1 


Lb 


p 


8,4% 


41 


WEA 


108 1 




Q 

o 


8.4% 


37 


EU 


108 1 






5,3% 


40 


FOG 1 -G8 


108 1 


LO 


1 1 
1 1 


1 1,G% 


41 


1X7RG1 


108 1 


LI 


8 


8,40/0 


70 


BLI 


108 1 


L8 


3 


3.20/0 


72 


KUE 


108 1 


LI 2(2) 


11 


II.6O/0 


32 


LUNmOl 


108 1 


LI 2(2) 


10 


10,50/0 


6 


HIV-bl 


106 1 


A20 


4 


4,30/0 


8 


HIV-S4 


103 1 


02 


2 


2,20/0 


8 



6^ 
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Table 2A: (continued) 



Name' 

CAR 
BR. 

CLL PATIENT 10 
CLL PATIENT 12 
KING 
V13 

CLL PATIENT 1 1 
CLL PATIENT 13 
CLL PATIENT 9 
HIV-B2 
HIV-b2 

CLL PATIENT 5 
CLL PATIENT 1 
CLL PATIENT 2 
CLL PATIENT 7 
CLL PATIENT 8 
HIV-b5 

aL PATIENT 3 
CLL PATIENT 4 
CLL PATIENT 18 
CLL PATIENT 17 
HIV-b20 
2C12 

1B11 

IHI 

2A12 

CUR 

6L0 

RF-TSl 

GAR- 
FLO 

PIE 

HAH 14.1 
HAH 14.2 



aa' Computed 
family' 



107 
107 

88 

88 

108 

95 

87 

87 

88 

106 

106 

88 

88 

88 

88 
88 

105 

87 

88 

85 

86 

107 

108 

108 

108 

108 

109 

109 

96 

109 

109 

109 

109 

109 



3 
3 
3 
3 
3 
3 
3 
3 



Germlinc 
gene* 

LI 2(2) 
LI 2(2) 

02 

02 
LI 2(2) 

L24 

02 

02 

012 

A20 

A20 

A20 
L8 
LB 
L5 
L5 
L5 
L8 
L9 
L9 

LI 2(2) 
A27 

LI 2(2) 

LI 2(2) 

LI 2(2) 

LI 2(2) 
A27 
A27 
A27 
A27 
A27 
A27 
A27 
A27 

55 



Diff. to 
germline* 

TT 
11 
0 
0 
12 
0 
0 
0 
1 
9 
9 
1 
2 
0 
0 
0 
11 
1 
0 
6 
7 
11 
20 
20 
21 
21 
0 
0 
0 
0 
0 
0 
1 
1 



% diff. to Reference' 
germ line* 



11.7% 
11,6% 
0.0% 
0.0^0 
12.6% 

o.o<yo 

O.OWo 
O.OO/o 
1.1% 
9.7% 
9.7% 
1.1% 
2.3% 
0.0% 
0,0% 
0.0% 
12,0% 
1,1% 
0,0% 
7.1% 
8,1% 

n.7% 

. 21.1% 
21.1% 
22,1% 
22.1% 
0.0% 
0.0% 
0.0% 
O.OWo 
O.OWo 
0.0% 
1.0% 
1,0% 



79 
50 
122 
122 
30 
46 
122 
122 
122 

8 

8 

122 
122 
122 
122 
122 
8 

122 
122 
122 
122 
8 

68 

68 

68 

68 

66 

16 

121 

67 

66 

91 

51 

51 
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Table 2A: 


(continued) 












Name' 


aa* 


Computed 


Germline 


Diff. to 


VtO Olll. lu 


Rpfprenc 

r»t 1 v.* 




family' 


gene 


germrme' 


germline'^ 




HAH 16.1 


109 


3 


A27 


1 


l.OOA) 


51 


NOV 


109 


3 


A27 


1 


1,00/0 


52 


'^l F12 


108 


3 


A27 


1 


1,00/0 


126 


8E10 


110 


3 


AZ7 


1 


1.0% 


25 


TH3 


109 


3 


pai 


1 


1,00/0 


25 


HIC (Rl 


108 


3 


A27 




0.00/0 


51 


SON 


110 


3 


A27 


1 


l.OO/o 


67 


PAY 


109 


3 


A27 


1 


1,00/0 


66 


HOT 


109 


3 


A27 


1 


1,00/0 


67 


mAhA6H4C5 


109 


3 


A27 


1 


1.00* 


12 


BOR' 


109 


3 


A27 


2 


2,10/0 


84 


RF-SJ3 

111 mfJ'J 


96 


3 


A27 


2 


-.,2.10/0 


121 


SIE 


109 


3 


A27 


2 


2.10/0 


15 


ESC 


109 


3 


A27 


2 


2.10/0 


98 


HEW 


110 


3 


A27 


2 


2.1% 


98 


YES8C 


109 


3 


A27 


3 


3.1% 


33 


Tl 


109 


3 


A27 


3 


3,10/0 


114 


mAbin 


109 


3 


pai 


3 


3,10/0 


71 


HEW 


107 


3 


A27 


0 


O.OO/o 


94 


BRO 


106 


3 


A27 


0 


o.m 


94 


ROB 


106 


3 


A27 


0 


O.OO/o 


94 


NG9 


96 


3 


A27 


4 


4.20/0 


11 


NEU 


109 


3 


A27 


4 


4.2% 


66 


WOL 


109 


3 


A27 


4 


4.2% 


2 


35G6 


109 


3 


A27 


4 


4.2% 


59 


RF-SJ4 


109 


3 


All 


0 


0.0% 


88 


KAS 


109 


3 


A27 


4 


4.2% 


84 


BRA 


106 


3 


A27 


1 


1,1% 


94 


HAH 


106 


3 


A27 


1 


1,1% 


94 


HIC 


105 


3 


A27 


0 


0,00/0 


94 


FS-2 


109 


3 


A27 


6 


6,3% 


87 


JH- 


107 


3 


A27 


6 


6,30/0 


38 


EVl-15 


109 


3 


A27 


6 


6,30/0 


83 


SCA 


108 


3 


A27 


6 


6,3% 


65 



SUBSTITUTE SHEET (RULE 26) 



wo 97/08320 
Table 2A: (continued) 



PCT/EP9d/03647 



Name^ 



aa 



^ Computed Germline Diff. to %diff. to 
3 4 germline^ germline 



Reference' 



family^ 



gene^ 



109 


3 


A27 


6 


6.3% 


71 


103 


3 


A27 


3 


3.3% 


94 


109 


3 


A27 


6 


6.3% 


120 


108 


3 


A27 


6 


6.3% 


98 


109 


3 


A27 


7 


7.3% 


82 


105 


3 


A27 


9 


9.7% 


125 


110 


3 


A27 


-9 


9.4% 


125 


105 


3 


A27 


5 


5.4% 


52 


89 


3 


A27 


1 


1,1% 


122 


107 


3 


A27 


7 


7.4% 


8 


107 


3 


A27 


7 


7.4% 


8 


99 


3 


A27 


1 


1.1% 


52 


108 


3 


L6 


0 


O.Wo 


85 


in? 


3 


L6 


0 


O.OO/o 


126 


1 Wl7 


3 


A27 


12 


12.5% 


7 




3 


A27 


1 


1,2% 


52 


96 


3 


LI 0(2) 


0 


0.0% 


89 


108 


3 


L6 


1 


1,1% 


109 




3 


L6 


1 


l.io/o 


109 


107 


3 


L6 


2 


2,1% 


99 


108 


3 


L6 


1. 


1,1% 


109 


108 


3 


L6 


1 


1,1% 


109 


107 


3 


L6 


2 


2,1% 


99 


107 


3 


L6 


2 


2.1% 


. 99 


108 


3 


L6 


1 


1,1% 


109 


108 


3 


L6 


1 


1.1% 


109 


107 


J 




2 


2.1% 


99 


107 


3 


L6 


2 


2.1% 


99 


108 


3 


L6 


1 


1.1% 


109 


107 


3 


L6 


3 


3.2% 


99 


109 


. 3 


A27 


13 


13,5% 


6 


108 


3 


A27 


13 


13.7% 


55 


99 


3 


A27 


3 


3.5% 


13 


108 


3 


L6 


4 


4.20/0 


111 
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mAbli2 

SIC. 

SA-4A 

SER 

GOL* 

B5G10K 

HG2B10K 

Taykv322 

CLL PATIENT 24 

HlV-b24 

HIV-b6 

TaYkv3lO 

KA3D1 

19.E7 

rsv6L 

TaYkv320 

Vh 

LS8 

LSI 

LS2S3-3 

LS2 

LS7 

LS2S3-4d 
LS2S3-4a 
LS4 
LS6 

LS2S3-10a 

LS2S3-8C 

LS5 

LS2S3-5 

LUNm03 

IARC/BL41 

Slkv22 

POP 
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Name' 

LS2S3-10b 

LS2S3-8f 

LS2S3-12 

HIV-B30 

HiV-B20 

HIV-b3 

HIV-S6 

YSE 

POM 

Humkv328 

CLL 

LES 

HlV-sS 

HIV-S7 

slkvl 

Humka31es 
slkvl 2 



aa' Computed 
family^ 



107 

107 

107 

107 

107 

108 

104 

107 

109 

95 

109 

96 

104 

104 

99 

95 

101 



RF-TS2 


95 


11-1 


109 


HiV-s3 


105 


RF-TMCI 


96 


GER 


109 


GF4/1.1 


109 


mAb114 


109 


HIV-loop13 


109 


bkvie 


86 


CLL PATIENT 29 


86 


slkv9 


98 


bkv17 


99 


slkvl 4 


99 


slkvl 6 


101 


bkv33 


101 


slkvl 5 


99 


bkvG 


100 



3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 



Germline 
gene* 

U 
L6 
L6 
A27 
A27 
A27 
A27 
L2/L16 
L2/L16 
L2/L16 
L2/L16 
L2/L16 
A27 
A27. 
A27 
L2/L16 
A27 
L2/L16 
L2/L16 
A27 
L6 
L2/L16 
L2/L16 
L2/L16 
L2/L1G 
L6 
L6 
L6 
L6 
L6 
L6 
L6 
L6 
L6 



Diff. to % diff. to 
germline* germline* 



3 

3 

3 
11 
11 
11 

9 

1 

9 

1 

3 

3 

11 

11 

7 

4 

8 

3 

4 

13 

10 

7 

8 

6 

7 

1 

1 

3 

1 

1 

2 

4 

2 

3 



3,2% 
3,2% 
3,2% 
11,7% 
11,7% 
11.7% 
9.9% 
1,1% 
9.4% 
1.1% 
3.2% 
3.2% 
12,1% 
12.1% 
8.1% 
4.2% 
9.2% 
3.2% 
4.2% 
14.3% 
10.5% 
7.4% 
8.4% 
6,3% 
7.4% 
1.2% 
1.2% 
3,5% 
1.2% 
1.2% 
2,3<yo 
4.7% 
2.3% 
3,5% 



Reference' 

"99" 
99 
99 
8 
8 
8 
8 

72 
53 
19 
47 
38 
8 
8 
13 
18 
13 
121 
70 
8 

121 
75 
36 
71 
8 

13 
122 
13 
13 
13 
13 
13 
13 
13 
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Table 2A: (continued) 



PCT/EP96/03647 



Name' aa' Computed Germline Diff.to o/odiff.to Reference' 







family^ 


gene* 


germlme 


germlme* 




RbBBK 


1 no 




1 '5/1 1C 


17 


12,60/o 


125 


AL 70U 


in? 

lU/ 


J 




Q 


9.5% 


117 


slkvl 1 


1 nn 

lUU 




LZ/L 1 0 


0 


3,5% 


13 


slkv4 


y/ 


J 


LD 


4 


4,8% 


13 


CLL PATIcNI 2b 


of 


•) 
J 


1 0/1 IK 
LZ/L ID 


1 
1 


1 IP/o 


122 


ALSe124 


1 AO 

103 


J 


1 0/1 1C 
LZ/LiD 


Q 




117 


slkvl3 


100 


3 


Lz/Llb 


c 
D 




13 


bkv7 


•4 r\r\ 

TOO 


3 


LZ/LlD 


c 
D 




13 


bkv22 


100 


3 


Lz/Llb 


C 
D 




13 


CLL PATIENT 27 


84 


3 


L2/Llb 


U 




122 


bkv35 


100 


3 


Lb 


o 
o 


Q OO/n 


11 


CLL PATIENT 25 


87 


3 


toil 4 ^ 

L2/L16 


4 


*f,u"/0 


172 


Slkv3 


86 


3 


L2/L16 


7 


P 10/n 
O, 1 "/O 


1*^ 


slkv7 


99 


1 


U2 


/ 


O, 1 


13 


HuFd79 


111 


3 


L2/L16 


z4 


OA. OO/rt 
Z*f,ZXfO 


71 


RAD 


99 


3 


A27 


Q 

y 


1 n OOA 


7R 


CLL PATIENT 28 


83 


3 


L2/Llb 


4 




122 


REE 


104 


3 


Lz/Llb 


ZD 


77 70/n 


95 


FR4 


99 


3 


AZ/ 


o 


9 90/q 


77 


MD3.3 


92 


3 


Lb 


1 
1 




54 


MD3.1 


92 




Lb 


n 


0,0% 


54 


GA3.6 


92 


3 


1 c 
Lb 


Z 


7 GO/n 


54 


M3,5N 


92 


3 


1 c 
Lb 


•3 
J 




54 


WEI' 


82 


•> 
3 


A2/ 


U 


nno/n 


65 


MD3.4 


92 


3 


Lz/Llb 


1 


1 '^Q/n 

1 ,0"U 


54 


MD3.2 


91 


3 


Lb 


o 
o 


T ftP/n 


54 


VER 


97 


3 


Am 

A27 


1 Q 

1 y 


ZZ,*T*VO 


70 


CLL PATIENT 30 


78 


3 


L6 


3 


3,80/0 


122 


M3.1N 


92 


3 


L2/L16 


1 


1.3% 


54 


MD3.6 


91 


3 


L2/n6 


0 


0.0% 


54 


MD3.8 


91 


3 


L2/L16 


0 


0.0% 


54 


GA3.4 


92 


3 


LG 


7 


9.00/0 


54 


M3.GN 


92 


3 


A27 


0 


O.OO/o 


54 


MD3.10 


92 


3 


A27 


0 


O.OO/o 


54 



SUBSTITUTE SHEET 



(RULE 26) 



wo 97/08320 . 
Table 2A: (continued) 



PCT/EP96/03647 



Name' 




Computed 
family^ 


Germline 
gene* 


Diff. to 

germlme 


% diff. to 
germiine 


Reference 


MD3.13 




«j 


A27 


0 


00% 


54 
54 


M03.7 


yj 




A27 


0 


0.0% 


MD3.9 




n 
J 


A97 


0 


0.0% 


54 


GA3.1 


93 


o 
o 


A? 7 


6 


7.6% 


54 


bkv32 


101 


o 
J 


A97 


5 


5.7% 


13 


GA3.5 


93 


3 


A97 
/ 


c; 
%j 


6,3% 


54 


6A3.7 


92 


3 


Pilf 


7 


8,9% 


54 


MD3.12 


92 


3 


AO? 


9 


2,5% 


54 


M3.2N 


90 


*> 
3 




a 


7,8% 


54 


MD3.5 


92 


-> 
3 




1 
1 


1,3% 


54 


M3.4N 


91 


3 


1 9/t 1 R 
LZ/L 1 D 


0 


10,3% 


54 


M3.8N 


91 


3 


1 9/M R 
LZ/L 1 0 


7 


9.0<Vo 


54 


M3.7N 


92 


J 


A97 


3 


3.8% 


54 


6A3.2 


92 


3 


A97 
r\Z / 




11,4% 


54 


GA3.8 


93 


3 


A\Z/ 


"T 


5.1% 


54 


GA3.3 


92 


3 


A07 
AZ/ 




10,1% 


54 


M3.3N 


92 


3 


AOT 
AZ/ 


c 


6,3% 


54 


B6 


83 


3 


A*)? 
AZ / 


0 

0 


1 1 ,3% 


78 


E29.1 KAPPA 


78 


3 


19/1 Ifi 
LZ/L 1 0 


0 


0.0% 


22 


sew 


i AO 
108 




OR 


12 


12.6% 


31 


REI-based CAMPATH-9 


lU/ 




wo 


14 


14.7% 


39 


RZ 


107 






14 


14,7% 


50 


Bl 


lUO 




08 


14 


14,70/b 


14 


AND 


10/ 




09 


13 


13,7% 


69 


2A4 


109 




no 


12 


12.6% 


23 


KA 


1 no 
lOo 




OR 


19 


20,0% 


107 


MEV 


109 




09 
UZ 


1 A 


14,7% 


29 


DEE 


lUb 




02 


13 


14.0% 


76 


OU(IOC) 


108 




02 


18 


18.9% 


60 


HuRSVl9VK 


in 




08 


21 


21.0% 


115 


SP2 


108 




02 


17 


1 7,9% 


93 


BJ26 


99 




08 


21 


24,1% 


1 


Nl 


112 




08 


24 


24,2% 


106 


BMA0310EUCIV2 


106 




L12(1) 


21 


22,3% 


105 



<56 



suBsnruTH sheet (rule 



26) 



wo 97/08320 

Table 2A: (continued) 



PCT/EP96/03647 





aa' Computed 
family^ 


Germline 
gene* 


Diff. to 
germline^ 


% diff. to 
germline* 


Reference 


CLL PATIENT 6 


71 


1 


Ann 

A20 


A 

u 


0 0<Vd 


122 
1 


BJ19 




1 


no 


1 D 


21,90/0 


6M 607 


1 1 o 


2 


A3 


n 
u 


000/0 


58 


R5A3K 


114 


2 


AJ 


1 

1 


1,0% 


125 


R1C8K 


1 1 4 


2 


A3 


1 
1 


1 OO/n 


125 


VK2.R149 


1 1 T 


2 


A3 


z 


9 no/n 


118 


TR1.6 


109 


2 


A3 


4 




92 


TR1.37 


104 


2 


A3 


5 


c no/ft 


3Z 


FS-1 




9 


A3 


6 


D.UWo 


R7 
0/ 


TR1.8 


1 in 




A *!) 

A3 


a 
o 


6 00/0 


92 


NIM 


in 

1 1 o 


2 


A3 


Q 

o 


ft fW/h 


28 


Inc 


112 


2 


A3 


1 1 


1 1 no/n 


35 


TEW 


107 


2 


A3 


n 
D 




96 


CUM 


114 


2 


Ul 


7 


£ 90/0 


44 


HRFl 


71 


2 




4 


5.6% 


124 


CLL PATIENT 19 


87 


2 


A*} 
Ao 




0,0% 


122 


CLL PATIENT 20 


87 


2 


A3 


U 


n no/n 


122 


MIL 


112 


2 


A3 


1 b 


1 K 90/n 
I D,Z*vO 


26 


FR 


113 


2 


A3 


zU 


9n no/n 


101 


MAL-Urine 


83 


1 


02 


c 

D 


8 60/0 


102 


T3ykv306 


73 


3 


AO? 


1 
t 


1,6% 


52 
52 


Taykv312 


75 


3 


AT? 


1 

\ 


1.6% 


HlV-b29 


93 


3 




1 4 


17,5% 


8 


1-185-37 


110 


3 


AO? 


0 


0,00/0 


119 


1-187-29 


110 


3 


A07 
Az/ 


n 


0.0% 


119 


TT117 


1 1 n 




A07 
Az / 


Q 

Z7 


9,40/a 


63 


HIV-loop8 


108 


3 


A07 
AZ / 


Ifi 


1 6.8% 


8 


rsv23L 


108 


3 


A27 


16 


16.80/0 


7 


HIV-b7 


107 


3 


A27 


14 


1 A QO/a 

1 4,yvfo 


0 
0 


HlV-bl 1 


107 


3 


A27 


15 


16.0% 


8 


HIV-LC1 


107 


3 


Ml 


19 


20,2% 


8 


HIV-LC7 


107 


3 


Ml 


20 


21,3% 


8 


HIV-LC22 


107 


3 


Ml 


21 


22.3% 


8 


HIV-LC13 


107 


3 


Ml 


21 


22.3% 


8 



SUBSnrUTE sheet (rule 26) 



wo 97/08320 PCT/EP9fi/03647 



Table 2A: (continued) 



Name' 


aa' 


Computed 
family^ 


Germline 
gene* 


Diff. to 
germline* 


% diff. to 
germline^ 


Referena 


HIV-LC3 


107 




A97 




22,3% 


8 


HIV-LC5 


107 


3 




91 
z 1 


22 30/0 


8 


HIV-LC28 


107 


3 


Az/ 


91 
Z 1 


77 30/0 


8 


HIV-b4 


107 


3 




99 
ZZ 


91 40/0 

ZO|t /u 


8 


CLL PATIENT 31 


87 


3 






1 7,2% 


122 


HIV-loop2 


108 


3 


10/1 1 c 


1 7 


1 7,9% 


8 


HIV-loop35 


108 


3 


1 O/l 1 c 


1 7 


17 QQ/b 


8 


HIV-LCl 1 


107 


3 


A 0*7 

A27 


90 
ZJ 


94. Wo 


8 


HIV-LC24 


107 


3 


ATT 


90 
Z J 


94 50/0 


8 


HlV-bl2 


107 


3 


A27 


Z*J 


9R RO/n 

Z J,3 /U 


8 


HIV-LC25 


107 


3 


A27 


OA 
Z4 


9^; ^cVft 


8 


HlV-b21 


107 


3 


A27 


OVI 

Z4 


91; c:Q/n 


8 


HIV-LC26 


107 


3 


A 0 "7 

A27 


oc 
Zb 


97 70/n 


8 


G3D10K 


108 


1 


112(2) 


1 0 
1 z 


1 Z,D '0 


125 


ni25 


108 


1 


Lb 


Q 


R AO/n 


63 


HIV-S2 


103 


3 


A27 


OQ 
ZO 


01 10/n 
01,1 *vO 


8 


265-695 


108 


1 


1 r 

L5 


•7 


7 AO/n 


0 


2-115-19 


108 


1 


A Oft 

A30 


z 


9 10/a 
Z, 1 "/O 


119 


rsvl3L 


107 


1 


02 


ZO 


91 10/n 


7 
/ 


HlV-bl8 


lOG 


1 


no 


1 *f 


1 ^ 10/n 


8 


RF-KL5 


98 


3 


L6 


OC 


OK 70/a 




ZM1-1 


113 


2 


A 1 "7 
Al / 


7 




3 


HIV-S8 


103 


1 




1 R 
1 D 


17 SCVo 


8 


K- EVIS 




c 


DZ 


0 


0,0% 


112 


or TC n 

RF-TSj 


inn 




A23 


0 


0.0% 


121 


HF-21/28 


111 


9 


A17 


1 


l.OO/o 


17 


RPIvll6410 




z. 


A17 


1 


1.0% 


42 


JCll 


113 


2 


A17 


1 


1,0% 


49 


U-o 1 


114 


2 


A17 


5 


5,0% 


45 


FK-001 


113 


4 


B3 


0 


0,0% 


81 


CD5+.28 


101 


4 


B3 


1 


1.0% 


27 


LEN 


114 


4 


83 


1 


1,0% 


104 


UC 


114 


4 


83 


1 


1.00/0 


111 


CD5+.5 


101 


4 


83 


1 


1.0% 


27 



SUBSTITUTE SHEET (RULE 26) 



wo 97/08320 
Table 2A: (continued) 



PCT/EP9d/03647 



Name' 




Computed 
family' 


Germline 
gene* 


UliT. 10 
germline* 


Qfri riiff tn 

/U Ul 1 1 ■ Ivl 

germline' 


Reference 


CD5+.26 


101 


4 


B3 


1 


l.Oo/o 


27 


CD5+.12 


101 


4 


B3 


2 


2.0% 


27 


CD5+.23 


101 


4 


B3 


2 


2.0% 


27 


CD5+.7 


101 


4 


83 


2 


2,0«yo 


27 


VJI 


113 


4 


83 


3 


3.0% 


56 


LOC 


113 


4 


83 


3 


3,00/0 


72 


MAL 


113 


4 


83 


3 


3,0% 


72 


CD5+.G 


101 


4 


83 


3 


3.0% 


27 


H2F 


113 


4 


83 


3 


3,00/0 


70 


PB17IV 


114 


4 


83 


4 


4.00/0 


74 


CD5+.27 


101 


4 


83 


4 


4,00/0 


27 


C05+.9 


101 


4 


83 


4 


4,0% 


27 


CD5-.28 


101 


4 


83 


5 


5.0% 


27 


CD5-.26 


101 


4 


83 


6 


5,9% 


27 


CD5+.24 


101 


4 


83 


6 


5.90/0 


27 


CD5+.10 


101 


4 


83 


6 


5.9% 


27 


CD 5-. 19 


101 


4 


83 


6 


5.90fe 


27 


CDS-. 18 


101 


4 


63 


7 


6.90/0 


27 


CD5-.16 


101 


4 


83 


8 


7.9% 


27 


C05-.24 


101 


4 


83 


8 


7.90/0 


27 


CD5-.17 


101 


4 


83 


10 


9.90/0 


27 


MD4.i 


92 


4 


83 


0 


0.0% 


54 


MD4.4 


92 


4 


83 


0 


0.0% 


54 


MD4.5 


92 


4 


DJ 




00% 


54 


MD4.G 


92 


4 


83 


0 


0.0% 


54 


MD4.7 


92 


4 


83 


0 


O.OO/b 


54 


MD4.2 


92 


4 


83 


1 


1,30/0 


54 


MD4.3 


92 


4 


83 


5 


6,3% 


54 


CLL PATIENT 22 


87 


2 


A17 


2 


2,3% 


122 


CLL PATIENT 23 


84 


2 


A17 


2 


2.4% 


122 



SUBSTITUTE SHEET (RULE 26) 



wo 97/08320 

Table 2B: rearranged human lambda sequences 



PCT/EP9fi/03647 



Name' 

WAH 
1B9/F2 
DIA 
mAb67 
HiH2 
NIG-77 
OKA 
KOL 
T2:C5 
T2:C14 
PR-TSl 
4G12 
KIM4GL 
Fog-B 
9F2L 
nrtAbl 1 1 
PH0X15 
BL2 

NIG-64 

RF-SJ2 . 

ALEZI 

ZIM 

RF-SJ1 

IGLV1.1 

NEW 

CB-201 

MEM 

H210 

NOV 

NEI 

AL MC 

MES 

FOG 1 -A3 
AL NOV 



aa' Computed 
family^ 

110 
112 
112 
89 
110 
112 
112 
112 
111 
110 
110 
in 
112 
111 
111 
110 
111 
111 
111 
100 
112 
112 
100 
98 
112 
87 
109 

111 2 

110 2 

111 2 

110 2 

112 2 

111 2 

112 2 



Germline Diff. to 



gene 

DPL3 
DPL3 
DPL2 
DPL3 
DPL3 
DPL2 
DPL2 
DPL2 
DPL5 
DPL5 
DPL5 
DPL5 
HUMLV117 
DPL5 
DPL5 
DPL5 
DPL5 
DPL5 
DPL5 
DPL5 
DPL5 
HUMLV117 
DPL5 
DPL4 
HUMLV117 
DPL2 
DPL2 
DPLIO 
DPLIO 
DPLIO 
DPL11 
DPL11 
DPL11 
DPL11 



0/0 diff. to Reference' 
germline* germline* 



7 

7 

7 

0 

12 

9 

7 

12 
0 
0 
0 
1 

0 
3 
3 
3 
4 
4 
4 
6 
7 
7 
9 
0 
11 
1 

6 
4 
8 
8 
6 
8 
9 
7 



70/0 
7% 
7% 
0% 
11% 
9% 
7% 
11% 
0% 
0% 
0% 
1% 
0% 
3% 
3% 
3% 
4% 
4% 
4% 
6% 
7Wo 
7% 
9% 
00/0 
10% 
10/0 
6% 
40/0 
8% 
8% 
6% 
8% 
9% 
70/0 



68 
9 

36 
29 
3 

72 
84 
40 

6 

6 

55 
35 
8 
31 
79 
48 
49 
74 
72 
78 
41 
18 
78 
1 

42 

62 
50 
45 
25 
24 
28 
84 
27 
28 



SUBSTITUTE SHEET (RULE 26) 
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Table 2B: (continued) 



PCT/EP96/03647 



Name' 

HMST-1 
HBW4-1 
WH 
11-50 
HBp2 
NlG-84 
VIL 
TRO 
ES492 
mAb216 
BSA3 
THY-29 
PR-TS2 

E29.1 LAMBDA 

mAb63 

TEL14 



aa' Computed 
family' 



110 

108 

110 

110 

110 

113 

112 

111 

108 

89 

109 

110 

108 

107 

109 

110 



6H-3C4 


108 


SH 


109 


AL GIL 


109 


H6-3C4 


108 


V-lambda-2.DS 


111 


8.12 ID 


110 


DSC 


in 


PVll 


no 


33.H11 


110 


AS17 


111 


SD6 


no 


KS3 


no 


PV6 


no 


NGD9 


no 


MUCl-1 


in 


A30c 


111 


KS6 


no 


TEL13 


in 



2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 



Germline Diff. to 
, germline 
gene ^ 



DPLII 
DPL12 
DPLII 
DPLlI 
DPL12 
DPLII 
DPLII 
DPL12 
DPLII 
DPL12 
DPL1B 
DPLIG 
DPL16 
DPL16 
DPL16 
0PL16 
DPL16 
DPL16 
DPL16 
DPL16 
DPLII 
DPLII 
DPLII 
DPLII 
DPLII 
DPLII 
DPLII 
DPLII 
DPL12 
DPLII 
DPLII 
DPLIO 
DPL12 
DPLII 
^5 



4 
9 
11 
7 
8 
12 
9 
10 
15 
1 

0 

0 

0 

1 

2 

6 

7 

7 

8 

8 

3 

3 

3 

1 



4 
7 
7 
9 
5 
7 

11 
6 
6 
11 



0/0 diff. to 
germline^ 

4% 
9% 
11% 
7% 
80/0 
11% 
9% 
10<Vo 
15Wo 
1% 
0% 
- 0% 
0% 
1% 
2<Vo 
6% 
7% 
70/0 
8% 
8% 
30/0 
30/0 
3% 
1% 
40/b 
70& 
70/b 
9% 
5% 
7% 
100/0 
6% 
6O/0 
10% 



Reference' 



82 
52 
34 
82 
3 

73 
58 
61 
76 
7 

49 

27 

55 

13 

29 

49 

39 

70 

23 

83 

15 

81 

56 

56 

81 

56 

56 

56 

56 

56 

27 

56 

56 

49 
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Table 2B: (continued) 



PCT/EP96/03647 



Name' 




Computed 
family^ 


Germline 
gene* 


Diff. to 
germline* 


% diff. to 

germiinc 


Referenc« 


AS7 


1 10 


z 


npi 19 


6 


6% 


56 
20 


MCG 


ill 

112 


I 


npi 1? 

ur L 1 c 


12 


11% 


U2G6L 


110 


2 


npi 1 0 
UrL 1 / 


13 


12% 


77 


PR-SJ2 


110 


2 


npi 10 

UrL 1 z 


14 


13% 


55 


BOH 


1 12 


o 

I 


UrL 1 z 


1 1 


10% 


37 


TOG 


1 1 1 


I 


npi 1 1 

Ur L 1 1 


19 


18% 


53 


TEL 16 


111 


2 


not 1 1 
UrLl 1 


1Q 


18% 


49 


No. 13 


110 


2 


npi in 


14 


13% 


52 


BO 


112 


2 


DrL12 


1 ft 
1 o 


17Qfe 


80 


WIN 


112 


2 


DrLl2 




16% 


11 


BUR 


104 


2 


DPL12 


1 5 


1 D"/0 


46 


NIG-58 


no 


2 


DPL12 


zU 




69 


WEIR 


112 


2 


not 1 1 
UrLl 1 


zo 


25% 


21 


THY-32 


111 




DPLB 


Q 

o 


RQ/h 

O fU 


27 


TNF-H9G1 


111 




npi ft 

UrLO 


9 


9<Vb 


27 


mAb61 


1 1 1 




nPi 1. 
ur lj 


1 


1% 


29 


LVILI 


no 






0 


0% 


54 


HA 


111 
1 1 J 




DPL3 


14 


13% 


63 


LAI LI 


tit 
1 1 1 






3 


3% 


54 


RHE 


112 




nPi 1 

UrL 1 


17 


16% 


22 


K1B12L 


113 




npi ft 

Ur LO 


17 


16% 


79 


LOC 


113 




r\Di o 
UrLz 


1 ^ 


14% 


84 


NIG-51 


112 




UrLz 


19 


11% 


67 


NEWM 


104 




nPl R 


23 


22% 


10 


MD3-4 


106 




nPI 91 


14 


13% 


4 


COX 


112 




DPI 9 

L^r LZ 


13 


12% 


84 


HiHiO 


1 nc 
1 Uo 




DPL23 


13 


12% 


3 


VUn 


112 




DPL2 


16 


15% 


16 


AL POL 


113 




0PL2- 


16 


15% 


57 


CD4-74 






DPL2 


19 


18% 


27 


AMYLOID MOL 


102 


3 


OPL23 


15 


15% 


30 


OST577 


108 


3 


Humlv318 


10 


10% 


4 


NlG-48 


113 


1 


DPL3 


42 


40% 


66 


CARR 


108 


3 


DPL23 


18 


17% 


19 
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Table 2B: (continued) 



Name 

mAb60 

NlG-68 

KERN 

ANT 

LEE 

CLE 

VL8 

MOT 

GAR 

32.B9 

PUG 

Tl 

RF-TS7 

YM-1 

K6HG 

K5C7 

K5B8 

K5G5 

K4B8 

K6F5 

HIL 

KIR 

CAP 
188 

SHO 

HAN 

cML23 

PR-SJ1 

BAU 

TEX 

X(PET) 

DOY 

COT 

Pag-1 



aa' 



Computed 
family' 



108 

99 
107 

106 

110 

94 

98 

110 

108 

98 

108 

115 

96 

116" 

112 

112 

112 

112 

112 

112 

108 

109 

109 

110 

108 

108 

96 

96 

107 

99 

107 

106 

106 

111 



3 

3 

3 

3 

3 

3 

8 

3 

3 

8 

3 

8 

7 

8 

8 

8 

8 

8 

8 

8 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 



Germline 
gene* 

DPL23 
DPL23 
DPL23 
DPL23 
DPL23 
DPL23 
DPL21 
Humlv318 
DPL23 
DPL21 
Humlv3l8 
HUMLV801 

DPL18 
HUMLV801 
HUMLV801 
HUMLV801 
HUMLV801 
HUMLV801 
HUMLV801 
HUMLV801 
DPL23 
DPL23 
OPL23 
DPL23 
DPL23 
DPL23 
DPL23 
DPL23 
DPL23 
DPL23 
DPL23 
OPL23 
DPL23 
Humlv318 



Diff. to 
germline' 

TT 

25 
26 
17 
18 
17 
0 
23 
26 
5 

24 
52 
4 
51 
20 
20 
20 
20 
19 
17 
22 
20 
19 
22 
19 
20 
3 
7 
9 
8 
9 
9 
13 
5 



% diff. to 
germline* 

13% 
26% 
25% 
16% 
17% 
17% 
0% 
22% 
25% 
5% 
23% 
50<yo 
4% 
49% 
19% 
19% 
19% 
19% 
18% 
16% 
21% 
19% 
18% 
21% 
18<Vo 
19% 
3% 
7% 
9% 
8% 
9% 
9% 
12% 
5% 



Reference' 

li" 
32 
59 
19 
85 
19 
81 
38 
33 
81 
19 
6 

60 
75 
44 
44 
44 
44 
44 
44 
47 
19 
84 
43 
19 
. 19 
12 
55 
5 

19 
51 
19 
19 
31 



SUBSTITUTE SHEET (RULE 25) 



wo 97/08320 
Table 2B: (continued) 
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Name' 




Computed 
family^ 


Germline 
gene* 


Diff. to % diff. to 
germline* germline" 


Reference' 




in? 
iU/ 




nui iiiv*} 1 0 


2 


2% 


19 


lAf IT 

WIT 


lUo 




numiv o 1 o 


7 


7% 


19 


I.RH 


1 f\Q 
lUo 


o 


t-liimlv/')1 Q 
nUmiVJ 1 o 


12 


11% 


19 


Sl-1 


1 HQ 


O 


nuniivo 1 0 


12 


11% 


52 


DEL 


108 


3 


Humlv318 


14 


13<Vb 


17 


TYR 


108 


3 


Humlv318 


11 


10% 


19 


J.RH 


109 


3 


Humlv318 


13 


12% 


19 


THO 


112 


2 


DPL13 


38 


36% 


26 


LBV 


113 


1 


DPL3 


38 


36% 


2 


WLT 


112 


1 


DPL3 


33 


31% 


14 


SUT 


112 


2 


DPL12 


37 


35% 


65 
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Table 2C: rearranged human heavy chain sequences 
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Name* 



aa' Computed 
family^ 


Germline 
gene^ 


Oiff. to "Vo diff. to 
germline* germline'' 


riefercriLC 


1 1Q 1 


VHl-13-12 


0 


O.OO/b 


31 




\/Hl-13-l2 


0 


0.0% 


31 


1 1 Q 1 


VHl-13-6 


4 


4.1% 


42 


QQ 1 


VHl-13-12 


10 


10.2% 


75 


QD 1 


VHl-13-6 


2 


2.00/0 


26 


QD 1 


VH1-13-G 


0 


0.00/0 


81 




VHl-13-6 


3 


3,1 OA) 


96 


lUb ' 


VHl-13-15 

VIII lu ■••r 


1 


1.00/0 


26 




VH1-13-G 


7 


7.1% 


81 


m 1 


\/Hl-13-6 


5 


5.^o^ 


115 


1 OO 1 


\/Hl-13-6 


6 


6.10/0 


55 


1 0O 1 

1 zZ » 


VH1-13-G 


10 


10,20/0 


70 


111 1 


VHl-13-16 

VIII 1 »/ IV 


11 


11.20/0 


77 


1 ZD * 


VHl-13-6 


10 


10,20/0 


71 


1 ZD 1 


VHl-12-7 


5 


5.10/0 


98 


1 Z3 * 


VHl-12-7 


5 


5.1% 


98 


1 Z^ ' 


VHl-12-7 


5 


5.1% 


98 


1 z u < 


VHl-12-7 


5 


5,1% 


98 


1 ZO 1 


VHl-12-7 


5 


5.10/0 


98 


191; 1 

1 ZO 1 


VHl-12-7 


5 


5,10/0 


98 


IOC 1 
IZD 1 


VHl-12-7 


6 


6.1% 


98 


lie 1 
IZD 1 


\/Hl-19-7 
V n 1 1 1 ' 


5 


5.10/0 


98 


125 1 


V n 1 - 1 z / 


6 


6.1% 


113 


iUd I 


VHl-12-7 

VIII 1 ' 


6 


6.10/0 


113 


1 ZD 1 


VHl-12-7 


6 


6.10/0 


113 


191; 1 
IZD 1 


VHl-12-7 


6 


6.1% 


113 


1 9R 1 

IZD ■ 


VHl-12-7 


6 


6.10/0 


113 


195 1 


VHl-12-7 


7 


7,10/0 


113 


122 1 


VHl-12-7 


0 


0.0% 


42 


122 1 


VHl-12-7 


10 


10.20/0 


21 


122 1 


VHl-12-1 


0 


0.00/0 


105 


127 1 


VHl-12-1 


0 


0.00/0 


55 


127 1 


VHl-12-1 


0 


0,0% 


55 


127 1 


VHl-12-1 


0 


0,0% 


54 


124 1 


VHl-12-1 


0 


0,00/0 


54 


127 1 


VHl-12-1 


0 


0.00/0 


54 



21/28 
8E10 
MUCl-1 
gFl 

VHGL 1.2 

HV1L1 

RF-TS7 

E55 1.A15 

HAILI 

UC 

WIL2 

R3.5H5G 

N89P2 . 

mAbn3 

LS2S3-3 

LS2S3-12a 

LS2S3-5 

LS2S3-12e 

LS2S3-4 

LS2S3-10 

LS2S3-12d 

LS2S3-8 

LS2 

LS4 

1^5 

LSI 

LS6 

LS8 

THY-29 

1B9/F2 

51P1 

NEI 

AND 

L7 

L22 

L24 
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i — " 

Name 


dd 


PnmnLitPd 

fsmily** 


Germline 
gene* 


Diff. to 
germline* 


% dIff. to 
germline* 


Reference' 


L26 


116 


1 


VHl-12-1 


0 


o.o<yo 


54 


L33 


119 


1 


VH 1-1 2-1 


0 


0.0% 


54 


L34 


117 


1 


VHl-12-1 


0 


0.0% 


54 


L36 


118 


1 


VHl-12-1 


0 


0.0% 


54 


L39 


120 


1 


VHl-12-1 


0 


0.0<Vo 


54 


L41 


120 




VHl-12-1 


0 


0.0% 


54 


L42 


125 


1 


VHl-12-1 


0 


o.o«yo 


54 


VHGL 1.8 


101 




VHl-12-1 


0 


0.0% 


26 


783c 


127 


1 


VHl-12-1 


0 


0.0% 


22 


X17n5 


127 


1 


VHl-12-1 


0 


0.0% 


37 


L25 


124 


1 


VHl-12-1 


0 


0.0% 


54 


L17 


120 




VHl-12-1 


1 


I.OWo 


54 


L30 


127 


1 


VHl-12-1 


1 


1.0<Vo 


54 


L37 


120 


1 


VHl-12-1 


1 


l.OO/o 


54 


TNF-E7 


116 


1 


VHl-12-1 


2 


2,o<yo 


42 


mAbl 1 1 


122 


1 


VHl-12-1 


7 


7.1% 


71 


III-2R 


122 




VHl-12-9 


3 


3.1% 


70 


KAS 


121 




VHl-12-1 


7 


7.1% 


79 


YES8C 


122 


1 


VHl-12-1 


8 


8,2% 


34 


RF-TSI 


123 


1 


VHl-12-1 


8 


8,2% 


82 


BOR' 


121 


1 


VHl-12-8 


7 


7.1% 


79 


VHGL 1.9 


101 


1 


• VHl-12-1 


8 


8,2% 


26 


mAb410.30F305 


117 


1 


VH 1-1 2-9 


5 


5.1% 


52 


EV1-15 


127 


1 


VHl-12-8 


10 


10,2% 


78 


mAbl 12 


122 


1 


VHl-12-1 


11 


11,20A) 


71 


EU 


117 


1 


VHl-12-1 


11 


11.2% 


28 


H210 


127 


1 


VHl-12-1 


12 


12,2% 


66 


TRANSGENE 


104 


1 


VHl-12-1 


0 


0.0% 


111 


CLL2-1 


93 




VHl-12-1 


0 


O.OWo 


30 


CLL10 13-3 


97 




VHl-12-1 


0 


0.0% 


29 


LS7 


99 




VHl-12-7 


4 


4,10/0 


113 


ALL7-1 


87 




VHl-12-7 


0 


0,0% 


30 


CLL3-1 


91 




VHl-12-7 


1 


1,0% 


30 


ALL56-1 


85 




VHl-13-8 


0 


0,0% 


30 


ALL1-1 


87 




VHl-13-6 


1 


1,0% 


30 


ALL4-1 


94 




VHl-13-8 


0 


0,OWo 


30 



7^ 



SUBSTITUTE SHEET (RULE 26) 



wo 97/08320 . 
Table 2C: (continued) 



PCT/EP96/03647 



Name' 


aa' 


Computed 
family' 


Germline 
gene* 


Diff. to o/o diff. to 
germline^ germline^ 


Reference' 


ALL56 15-4 


85 




t /LI t 1 O O 

vHl-13-8 


5 


5.1% 


79 


CLL4-1 


88 




VHl-13-1 


1 


1,0% 


30 


Au52.1 


98 




VH1-12-5 


0 


0.0% 


*T J 


RF-TS3 


120 




VHl-12-5 


1 


1,0% 


Q9 
oZ 


Au4.1 


98 




VHl-12-5 


1 


1,0% 


•f3 


HPl 


121 




VH1-13-G 


13 


13,3% 


1 in 


BLI 


127 




VHl-13-15 




5.1% 


77 
/ z 


No.13 


127 




VHl-12-2 


19 


1 9,4% 


/ D 


TR1.23 


122 




VHl-13-2 


23 


23,5% 


00 


Sl-1 


125 




VHl-12-2 


18 


18,4% 


/ D 


TR1.10 


119 




VHl-13-12 


14 


1 4,3% 


00 


E55 1.A2 


102 




VHl-13-15 




3.1% 


ZD 


SP2 


119 




VHl-13-6 


1 Q 

1 *J 


1 5,3% 


©y 


TNF-H9G1 


111 




VHl-13-18 


z. 


2.0% 


4Z 


G3D10H 


127 




VHl-13-16 




1 9,4% 


1 Z / 


TR1.9 


118 




VHl-13-12 


14 


14,3% 


00 


TR1.8 


121 




VHl-12-1 


z*t 


24,5% 


QQ 
00 


LUNmOl 


127 




VH 1-1 3-6 




22 4% 


Q 


K1B12H 


127 




VHl-12-7 


71 


23.5% 


1 77 
1 Z / 


L3B2 


99 




VH1-13-G 




2.0% 




ss2 


100 




VH1-13-G 




2.0% 


4d 


No.86 


124 




VHl-12-1 


70 


20,4% 


7b 


TR1.6 


124 




VHl-12-1 


1Q 


1 9,4% 


00 
00 


SS7 


99 




VHl-12-7 


3 


3.1% 


A C 

4b 


S5B7 


102 




VHl-12-1 


0 


0,0% 


4b 


S6A3 


97 




VHl-12-1 


0 


0,0% 


AO. 


SS6 


99 




VHl-12-1 


0 


0.0% 


Ho 


L2H7 


103 




VHl-13-12 


0 


0.0% 


HO 


S6BG8 


93 




Vn 1-1 J-lz 


0 


0,0% 


46 


s6C9 


107 




Vn 1-1 J-lz 


0 


0.0% 




HIV-b4 


124 




VHl-13-12 


21 


21,40/0 


12 


HIV-b12 


124 




VHl-13-12 


21 


21,40/0 


12 


L3G5 


98 




VHl-13-6 


1 


1.00/0 


46 


22 


115 




VHl-13-6 


1 1 


11,20/0 


118 


L2A12 


99 




VHl-13-15 


3 


3.10/0 


46 


PH0X15 


124 




VHl-12-7 


20 


20,40/0 


73 



^1 
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■ 1 

Name 


aa' Computed 
family^ 


Germline 
gene* 


Diff. to °/o diff. to 
germline' germline* 


Referenci 


LUNm03 


127 


1 


VH1-1X-1 


1 O 
1 O 


1 Q AOIn 


n 
3 


CEA4-8A 


129 


1 


VH1-12-7 


1 


I.U'TO 


42 


M60 


121 


2 


VH2-31-3 


3 


J,0"«) 


lUJ 


HiHlO 


127 


2 


VH2-31-5 


9 


9,00/0 


A 

H 


COR 


119 


2 


VH2-31-2 


n 


1 1 ,0% 


91 


2-115-19 


124 


2 


VH2-31-11 


8 


8.1"/o 


124 


OU 


125 


2 


VH2-31-14 


20 


25,6% 


92 


HE 


120 


2 


VH2-31-13 


19 


1 9,0% 


27 


CLL33 40-1 


78 


2 


VH2-31-5 


2 


2.0% 


29 


E55 3.9 


88 


3 


VH3-11-5 


7 


7,2% 


26 


MTFC3 


125 


3 


VH3-14-4 


21 


21,0% 


131 


MTFC1 1 


125 


3 


VH3-14-4 


21 


2 1 ,0% 


131 


MTFJl 


114 


3 


VH3-14-4 


21 


21,0% 


131 


MTFJ2 


114 


3 


VH3-14-4 


21 


21,0% 


131 


MTFUJ4 


100 


3 


VH3-14-4 


21 


2 1 ,0% 


131 


MTFUJ5 


100 


3 


VH3-14-4 


21 


2 1 ,uMro 


131 


MTFUJ2 


100 


3 


VH3-14-4 


22 


22,0% 


131 


MTFC8 


125 


3 


VH3-14-4 


23 


23.0% 


131 


TDeVq 


113 


3 


VH3-14-4 


0 


U,0% 


16 


rMTF 


114 


3 


VH3-14-4 


5 


5.0% 


131 


MTFUJ6 


100 


3 


VH3-14-4 


10 


1 0,0% 


131 


RF-KES 


107 


3 


• VH3-14-4 


9 




85 


N51P8 


126 


3 


VH3-14-1 


9 




77 


TEI 


119 


3 


VH3-13-8 


O 1 
Z 1 


9 1 AO/n 


20 


33.H11 


115 


3 


VH3-13-19 


1 n 
lU 


1 n 90*^ 

I U,Z A3 


129 


SB1/D8 


101 


3 


VH3-1X-8 


1 A 


1 *ft\J IsJ 


2 


38P1 


119 


3 


VH3-11-3 


u 




104 


BRO'IGM 


119 


3 


VH3-11-3 


13 


13.4% 


19 


NIE 


119 


3 


VH3-13-7 


15 


15,3% 


87 


306 


126 


3 


VH3- 13-26 


5 


5,1% 




ZMl-1 


112 


3 


VH3-11-3 


8 


8.2% 


5 


E55 3.15 


110 


3 


VH3- 13-26 


0 


0,0% 


26 


gF9 


108 


3 


VH3-13-8 


15 


15.30/0 


75 


THY-32 


120 


3 


VH3- 13-26 


3 


3.1% 


42 


RF-KL5 


100. 


3 


VH3- 13-26 


5 


5.1% 


96 


OST577 


122 


3 


VH3-13-13 


6 


6.1% . 


5 



7^2- 
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Name 






Germline 


Diff. to 


o/o diff, to 


Referenc< 




1 diiiiiy 


gene^ 


germline* 


germline* 




BO 


113 


3 


VH3-13-19 


15 


15,3% 


lO 


TT125 


121 


3 


VH3-13-10 


15 


15,3% 


64 


2-115-58 


127 


3 


\/H3-13-10 


11 


11.2% 


124 


KOL 


126 


3 


VH3-13-14 


16 


16.3% 


102 


mAbGO 


118 


3 


VH3-13-17 


14 


14,3% 


45 


RF-AN 


106 


3 


VH3-13-26 


8 


8.2% 


85 


BUT 


115 


3 


VH3-11-6 


13 


13.4% 


119 


KOL-based CAMPATH- 














9 


118 


3 


VH3-13-13 


16 


16.3%- 


41 


B1 


119 


3 


VH3-13-19 


13 


13.3% 


53 


N98P1 


127 


3 


VH3-13-1 


13 


13,30/0 


77 


nii7 


107 


3 


VH3-13-10 


12 


12,2% 


64 


WEA 


114 


3 


VH3-13-12 


15 


15.3% 


40 


HIL 


120 


3 


VH3-13-14 


14 


14.3% 


23 


S5A10 


97 


3 


VH3-13-14 


0 


0,0% 


46 


SSDII 


98 


3 


VH3-13-7 


0 


0,0% 


46 


sGCS 


100 


3 


VH3-13-7 


0 


0.0% 


46 


S6H12 


98 


3 


VH3-13-7 


0 


0.0% 


46 


VH10.7 


119 


3 


VH3-13-14 


16 


16.3% 


128 


HIV-loop2 


126 


3 


VH3-13-7 


16 


16,3% 


12 


HIV-loop35 


126 


3 


VH3-13-7 


16 


16,3% 


12 


TRO 


122 


3 


VH3-13-1 


13 


13,3% 


61 


SA-4B 


123 


3 


VH3-13-1 


15 


15,3% 


125 


L2B5 


98 


3 


VH3-13-13 


0 


0.0% 


46 


S6E11 


95 


3 


VH3-13-13 


0 


0,0% 


46 


s6H7 


100 


3 


VH3-13-13 


0 


0.0% 


46 


SSI 


102 


3 


VH3-13-13 


0 


0.0% 


46 


ss8 


94 


3 


VH3-13-13 


0 


0.0% 


46 


DOB 


120 


3 


VH3-13-26 


21 


21,40A) 


116 


THY-33 


115 


3 


VH3-13-15 


20 


20.4% 


42 


NOV 


118 


3 


VH3-13-19 


14 


14.J'V0 


JO 


rsvl3H 


120 


3 


VH3-13-24 


20 


20,4% 


11 


L3G11 


98 


3 


VH3-13-20 


2 


2.0% 


46 


L2E8 


99 


3 


VH3-13-19 


0 


0.0% 


46 


L2010 


101 


3 


VH3-13-10 


1 


1,0% 


46 


L2E7 


98 


3 


VH3-13-10 
73 


1 


1,0% 


46 
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Table 2C: (continued) 



Name' 



aa* Computed 
family' 



Germline Diff. to 

4 qermline* 
gene ^ 



o/o diff. to 
germline* 



Reference' 



m 3 VH3-13-24 0 0.0<yo 46 

L2E5 97 3 VH3-13-2 1 l.W" 

BUR 119 3 VH3-13-7 21 21.4% 67 

S4D5 107 3 VH3-11-3 1 hWo 46 

,9 116 3 VH3-13-16 4 4.1% 118 

S5D4 99 3 VH3-13-1 0 0.0% 46 

5eA8 100 3 VH3-13-1 0 0.0% 46 

HIV-I00P13 123 3 VH3-13-12 17 17.3% 12 

yR, 32 112 3 VH3-11-8 18 18.6% 88 

L2B10 97 3 VH3-11-3 1 1.0°* 46 

jR,5 114 3 VH3-11-8 21 21.6% 88 

S6H9 101 3 VH3-13-25 0 0.0% 46 

8 112 3 VH3-13-1 6 6.1% 118 

23 115 3 VH3-1.3-1 6 6.1% 118 

7 115 3 VH3-13-1 4 4.1% 118 

TR, 3 120 3 VH3-11-8 20 20.6% 88 

18/2 125 3 VH3-13-10 0 0,0% , 32 

18/9 125 3 VH3-13-10 0 0.0% 31 

30P1 lis 3 VH3-13-10 0 0.0% 106 

HF2-1/17 125 3 VH3-13-10 0 0.0% 8 

A77 109 3 VH3-13-10 0 0.0% 44 

B19.7 108 3 VHS-U-IO 0 0.0% 44 

119 3 VH3-13-10 0 0,0«yo 103 

l/,7 125 3 VH3-13-10 0 0.0«yo 31 

18/17 125 3 VH3-13-10 0 0.0% 31 

E54 3 4 109 3 VH3-13-10 0 0.0% 26 

LAMBDA-VH26 98 3 VH3-13-10 1 1.0% 95 

E54 3 8 111 3 VH3-13-10 1 1.0% 26 

GL16 106 3 VH3-13-10 1 1.0% 44 

4G12 125 3 VH3-13-10 1 1.0% 55 

A73 106 3 VH3-13-10 2 2.0% 44 

AL13 111 3 VH3-13-10 3 3.1% 117 

3A290 118 3 VH3-13-10 2 2.0% 108 

Abi8 127 3 VH3-13-8 2 2.0% 100 

E54 3.3 105 3 VH3-13-10 3 3.1% 26 

35G6 121 3 VH3-13-10 3 3.1% 57 



SUBSTITUTE SHEET (RULE 26) 



PCT/EP96/03647 



WO 97/08320 

Table 2C: (continued) 



Name' 



aa' Computed 
family' 



A95 

Ab25 

N87 

ED8.4 

RF-KL1 

AL1.1 

AL3.1 1 

32.B9 

TKl 

POP 

9F2H 

VD 

Vh38CI.10 

\/h38a.9 

Vh38Cl.8 

63P1 

60P2 

AL3.5 

GF4/1.1 

Ab21 

TDdVp 

V1138CI.4 

Vh38CI.5 

AL3.4 

FOG 1 -A3 

HA3D1 

E54 3.2 

mAb52 

mAb53 

mAb56 

mAb57 

mAb58 

mAb59 

mAblOB 

rnAblO? 

E55 3.14 



107 

128 

126 

99 

122 

112 

102 

127 

109 

123 

127 

115 

121 

121 

121 

120 

117 

90 

123 

12G 

118 

119 

119 

104 

115 

117 

112 

128 

128 

128 

128 

128 

128 

128 

128 

110 



3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 



Germline Diff. to 
gene* germiine 



0/0 diff. to Reference' 
germiine* 



VH3-13-10 
VH3-13-10 
VH3-13-10 
VH3-13-10 
VH3-13-10 
VH3-13-10 
VH3-13-10 
VH3-13-8 
\/H3-13-10 
VH3-13-10 
VH3-13-10 
VH3-13-10 
VH3-13-10 
VH3-13-10 
VH3-13-10 
VH3-11-8 
VH3-11-8 
VH3-13-10 
VH3-13-10 
VH3-13-10 
VH3-13-17 
VH3-13-10 
VH3-13-10 
VH3-13-10 
VH3-13-19 
VH3-13-21 
VH3-13-24 
VH3-13-12 
VH3-13-12 
VH3-13-12 
VH3-13-12 
VH3-13-12 
VH3-13-12 
VH3-13-12 
VH3-13-12 
VH3-13-19 



5 
5 
4 
6 
6 
2 
1 

6 
2 
8 
9 
9 
8 
8 
8 
0 
0 
•2 
10 
12 
2 
8 
8 
1 
2 
1 ' 
0 
2 
2 
2 
2 
2 
2 
2 
2 
0 



5.1% 

5.1% 

4.1% 

6.1% 

6.1% 

2.0% 

1.0<Vo 

6.1% 

2.0% 

8.2% 

9.2% 

9.2% 

8,2% 

8.2% 

8.2% 

0.0% 

0.0% 

2,0% 

10.2% 

12.2% 

2,0<yo 

8.2W0 

8.2<Vo 

1.0% 

2.0<yo 

}.m 

O.OO/o 

2,m 

2,0% 

2,0% 

2.0% 

2,0% 

2.0% 

2.0% 

2.0% 

0.0% 



44 
100 
77 
2 

82 
117 
117 
129— 
117 
115 
127 
10 
74 - 
74 
74 
104 
104 
117 
39 
100 
16 
74 
74 
117 
42 
81 
26 
51 
51 
51 
61 
51 
51 
51 
51 
26 



7^ 
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Table 2C: (continued) 



Name' 

F13-28 
mAbSS 
YSE 

E55 3.23 

RF-TS5 

N42P5 

F061-H6 

0-81 

HIV-S8 

mAblH 

33.F12 

4B4 

M26 , 

VHGL 3.1 

E55 3.13 

SB5/D6 

RAY4 

82-DV-D 

MAL 

LOC 

LSF2 

HIB RC3 

56P1 

M72 

M74 

E54 3.5 

2E7 

2P1 

RF-SJ2 

PR-TS1 

KIM46H 

£55 3.6 

E55 3.10 

3.B6 

E54 3.6 

FL2-2 



aa' Computed 
family^ 



Germiine Diff. to % diff. to 
gene* gerf^l'"^ germiine 



106 


3 


VH3-13-19 


1 


1.0% 


127 


3 


VH3-13-18 


4 


4.1% 


117 


3 


VH3-13-24 


6 


6.1% 


106 


3 


\/H3-13-l9 


2 


2,OWo 


101 


3 


VH3-13-1 


3 


3.1% 


124 


3 


VH3-13-2 


7 


7.1% 


110 


3 


\/H3-13-l6 


7 


7.1% 


115 


3 


VH3-13-19 


11 


11.2% 


122 


3 


VH3-13-12 


11 


11,2% 


125 


3 


VH3-13-19 


12 


12.2% 


116 


3 


VH3-13-2 


4 


4.1% 


119 


3 


VH3-1X-3 


0 


0.00/0 


123 


3 


VH3-1X-3 


0 


O.OO/o 


100 


3 


VH3-1X-3 


0 


O.OO/o 


113 


3 


VH3-1X-3 


1 


1.00/0 


101 


3 


VH3-1X-6 


3 


3.0% 


101 


3 


VH3-1X-6 


3 


3.00* 


106 


3 


VH3-1X-3 


5 


5.00/0 


129 


3 


VH3-1X-3 


5 


5.0% 


123 


3 


VH3-1X-6 


5 


5.0% 


101 


3 


VH3-1X-6 


11 


11.0% 


100 


3 


• \/H3-lX-6 


11 


11.00/0 


119 


3 


VH3-13-7 


0 


0.0% 


122 


3 


VH3-13-7 


0 


0.0% 


121 


3 


VH3-13-7 


0 


0.0% 


105 


3 


VH3-13-7 


0 


0,0% 


123 


3 


VH3-13-7 


0 


O.OWo 


117 


3 


VH3-13-7 


0 


0,00/0 


127 


3 


VH3-13-7 


1 


I.OWo 


114 


3 


VH3-13-7 


1 


I.OO/o 


127 


3 


VH3-13-13 


0 


O.OO/o 


108 


3 


VH3-13-7 


2 


2.00/0 


107 


3 


VH3-13-13 


1 


l.OO/o 


114 


3 


VH3-13-13 


1 


1.00/0 


110 


3 


VH3-13-13 


1 


I.OO/o 


114 


3 


VH3-13-13 


1 


I.OO/o 



94 

51 

72 

26 

85 

77 

42 

47 

12 

71 

129 

101 

103 

26 

26 

2 

2 

112 
72 
72 

2 

1 

104 

103 

103 

25 

63 

104 

83 

85 

18 

26 

26 

108 

26 

80 
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Table 2C: 
Name' 

RF-SJ3 

E55 3.5 

BSA3 

HMST-1 

RF-TS2 

ESS 3.12 

19.E7 

11-50 

E29.1 

ESS 3.16 

TNF-E1 

RF-SJl 

F0G1-A4 

TNF-A1 

PR-SJ2 

HN.14 

CAM' 

HIV-B8 

HIV-b27 

HlV-b8 

HIV-S4 

HIV-B26 

H1V-B35 

HIV-bl8 

HIV-b22 

HlV-bl3 

333 

IHI 

1B11 

CLL30 2-3 
GA 
JeB 
GAL 

K6H6 

K4B8 

K5B8 



(continued) 



aa' 



Computed 
family' 



Germline 
gene" 



Diff.to «%>diff.to Reference' 
germline* germline' 



112 
105 
121 
119 
126 
109 
126 
119 
120 
108 
117 
127 
116 
117 
107 
124 
121 
125 
125 
125 
125 
125 
125 
125 
125 
125 
117 
120 
120 
86 
110 
99 
110 
119 
119 
119 



3 

3 

3 

3 

3 

3 

3 

3 

3 
3 
3 
3 
3 
3 
3 
3 
3 
3 



3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 



VH3-13-7 
VH3-13-14 
VH3-13-13 
VH3-13-7 
VH3-13-13 
VH3-13-15 
VH3-13-14 
\/H3-13-13 
VH3-13-15 
VH3-13-7 
VH3-13-7 
\/H3-13-13 
VH3-13-7 
VH3-13-15 
VH3-13-14 
VH3-13-13 
VH3-13-7 
VH3-13-7 
VH3-13-7 
VH3-13-7 
VH3-13-7 
VH3-13-7 
VH3-13-7 
VH3-13-7 
VH3-13-7 
VH3-13-7 
VH3-14-4 
VH3-14-4 
VH3-14-4 
VH3-13-19 
VH3-13-7 
VH3-13-14 
VH3-13-19 
VH3-1X-6 
VH3-1X-6 
VH3-1X-6 



2 
1 
1 
3 
4 
0 
3 
6 

2 

6 

7 

6 

8 

4 

8 

10 

12 

9 

9 

9 

9 
9 

10 
10 
11 
12 
24 
24 
23 
1 

19 
3 

10 
18 
18 
18 



2.0'yo 

1.0% 

1,0% 

3,1% 

4,1% 

0.0% 

3.1% 

6,1% 

2,0«yo 

6,1% 

7,1% 

6.1% 

8.2% 

4.1% 

8,2% 

10,2% 

12,2% 

9,2% 
9,2% 
9.2% 
9.2% 
9.20/0 
10.2% 
10.2% 
11.2% 
12.2% 
24.0% 
24.0% 
23.0<yo 
l.OO/b 
19.4% 
3.1% 
10,2% 
I8.OW0 
18,0% 
18.0% 



85 
26 
73 
130 
82 
26 
129 
130 
25 
25 
42 
83 
42 
42 
85 
33 
65 
12 
12 
12 
12 
12 
12 
12 
.12 
12 
24 
24 
24 
29 
36 
7 

126 
60 
60 
60 
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Name' 

K5C7 
K5G5 
K6F5 
AL3.16 
N86P2 
N54P6 

LAMBDA HTl 12-1 

HY18 

mAb63 

FS-3 

FS-5 

FS-7 

FS-8 

PR-TS2 

RF-TMC 

mAb216 

mAb410.7.F91 

mAbA6H4C5 

Ab44 

6H-3C4 

F5-6 

FS-2 

HIG1 

F5-4 

SA-4A 

LES-C 

DI 

Ab26 
TS2 

265-695 

WAH 

268-D 

58P2 

mAb67 

4139 

mF7 



aa' Computed 
family^ 



119 

119 

119 

98 

98 

95 

126 

121 

126 

105 

111 

107 

110 

105 

102 

122 

122 

124 

127 

124 

108 

114 

126 

105 

123 

119 

78 

126 

124 

115 

129 

122 

118 

128 

115 

111 



3 

3 

3 

3 

3 

3 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 



Germline 
gene* 

VH3-1X-6 
VH3-1X-6 
VH3-1X-6 
VH3-13-10 
VH3- 13-10 
VH3-13-16 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-2 
VH4-11-9 
VH 4-3 1-4 
VH4-31-12 
VH4-11-7 
VH4-31-13 
VH4-11-8 
VH4-11-8 
VH4-21-4 
VH4-11-8 
VH4-31-13 



Diff. to 
germline' 

li" 

19 
19 
1 
3 
7 
0 
0 
0 
0 
0 
0 
0 
0 



0 

1 

1 

1 

2 

3 

6 

6 

7 

8 

9 

10 

16 

8 

15 
16 
19 
22 
0 
1 
2 
3 



% diff. to 
germline^ 

19.0% 
19.0% 
19.00/0 
1.0% 
3.1% 
7.1% 
0.0% 
0.0% 
0.0% 
O.OO/o 
0,0% 
0.0% 
0.0% 
0.00/0 
O.OO/o 
1.00/0 
^,Wo 
LOP/o 
2,1% 
3,1% 
6,2% 
6,2% 
7,2% 
8,2% 
9,3% 
10.3% 
16.5% 
8.1% 
15,2% 
16.50/0 
19,20/0 
22,70/0 

0. 00/0 

1. QO/o 
2,10/0 
3,00/0 



Reference' 

lo" 

60 
60 
117 
77 
77 
3 

43- 
45 
86 
86 
86 
86 
85 
85 
15 
52 
15 
100 
59 
86 
84 
62 
86 
125 
99 
58 
100 
110 
5 

93 
6 

104 
45 
108 
75 
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Name' 



Computed 
family' 



Germline Diff. to % diff. to 
gene* germline* germline' 



Reference' 



33.C9 


122 


4 


VH4-21-5 


7 


7,1% 


129 


Paq-1 


124 


4 


VH4-11-16 


5 


5.2% 


50 


B3 


123 


4 


VH4-21-3 


8 


8,2% 


.53 


IC4 


120 


4 


VH4-11-8 


6 


6.2% 


70 


C6B2 


127 


4 


VH4-31-12 


4 


4,0% 


48 


N78 


118 


4 


VH4-11-9 


11 


11.3% 


77 


B2 


109 


4 


VH4-11-8 


12 


12.4% 


53 


WRD2 


123 


4 


VH4-11-12 


6 


6.2% 


90 


mAb42G.4.2F20 


126 


4 


VH4-11-8 


2 


2.1% 


52 


E54 4.58 


115 


4 


VH4-11-8 


-1 


1.0% 


26 


WRD6 


123 


4 


VH4-11-12 


10 


10.3% 


90 


mAb426.12.3Fl.4 


122 


4 


VH4-11-9 


4 


4.1% 


52 


E54 4.2 


108 


4 


VH4-21-6 


2 


2.0% 


26 


WIL 


127 


4 


VH4-31-13 


0 


o.o<yo 


90 


COF 


126 


4 


\/H4-31-13 


0 


o,o<yo 


90 


LAR 


122 


4 


VH4-31-13 


2 


2.0% 


90 


WAT 


125 


4 


VH 4-31-13 


4 


A.Wo 


90 


mAb61 


123 


4 


VH4-31-13 


5 


5.10/d 


45 


WAG 


127 


4 


VH4-31-4 


0 


0,00/0 


90 


RF-SJ4 


108 


4 


VH4-31-12 


2 


2.0oyb 


85 


E54 4.4 


no 


4 


VH4-11-7 


0 


O.QPh 


26 


E55 4.A1 


108 


4 


VH4-11-7 


0 


O.QO/o 


26 


PR-SJ1 


103 


4 


VH4-11-7 


1 


\.Wo 


85 


E54 4.23 


111 


4 


VH4-11-7 


1 


1.00/0 


26 


CLL7 7-2 


97 


4 


VH4-n-12 


0 


O.QPh 


29 


37P1 


95 


4 


\/H4-11-12 


0 


0.00/0 


104 


ALL52 30-2 


91 


4 


VH4-31-12 


4 


4.00/b 


29 


rn\/ o 1 


98 


5 


VH5-12-1 


0 


O.OO/o 


13 


CB-4 


98 


5 


VH5-12-1 


0 


0,00/0 


13 


CLL-12 


98 


5 


VH5-12-1 


0 


0.0% 


13 


L3-4 


98 


5 


VH5-12-1 


0 


0,0% 


13 


can 


98 


5 


VH5-12-1 


0 


0.0% 


17 


C0RD3 


98 


5 


VH5-12-1 


0 


O.OWo 


17 


C0RD4 


98 


5 


VH5-12-1 


0 


0,00/0 


17 


CORDS 


98 


5 


VH5-12-1 


0 


O.OO/o 


17 


C0RD9 


98 


5 


VH5-12-1 


0 


O.OO/o 


17 
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■ i " 

Name 


CO 


familv^ 


Germlinc 
gene' 


Diff. to % diff. to 
germline* germline* 


Reference^ 


CD+1 


98 


5 


VH 5-12-1 


0 


0,0% 


17 
17 


CD+3 


98 


5 


VH5-12-1 


0 


0.0% 


CD+4 


98 


5 


VH5-12-1 


0 


o,o«yo 


17 


CD-1 


98 


5 


VH5-12-1 


0 


0,0% 


17 


CD-5 


98 


5 


VH5-12-1 


0 


0.0% 


17 


VERG14 


98 


5 


VH5-12-1 


0 


0,0% 


17 


PBL1 


98 


5 


VH5-12-1 


0 


0,0% 


17 


PBLIO 


98 


5 


VH5-12-1 


0 


0.0% 


17 


STRAbSA-lA 


127 


5 


VH5-12-1 


0 


00% 


125 


DOB" 


122 


5 


VH5-12-1 


0 


00% 


97 


VER65 


98 


5 


VH5-12-1 


0 


00% 


17 


PBL2 


98 


5 


VH5-12-1 




1.0% 


17 


Tul6 


119 


5 


VH5-1-2-1 


1 


1.00A) 


49 


PBL12 


98 


5 


VH5-12-1 


1 


1.0% 


17 


CD+2 


98 


5 


VH5-12-1 


1 


1.0% 


17 
17 


CORD 10 


98 


5 


VH5-12-1 


1 


1.0% 


PBL9 


98 


5 


VH5-12-1 




1.0% 


17 


C0RD2 


98 


5 


VH5-12-1 


2 


2,0% 


17 


PBL6 


98 


5 


VH5-12-1 


2 


2.0% 


17 


CORDS 


98 


5 


VH5-12-1 


2 


2.0VO 


17 


CD-2 


98 


5 


VH5-12-1 


2 


2.0% 


17 


C0RD1 


98 


5 


VH5-12-1 


2 


2,0% 


17 


CD-3 


98 


5 


VH5-12-1 


3 


3.1% 


17 


VERG4 


98 


5 


VH5-12-1 


3 


3,1% 


17 


PBL13 


98 


5 


VH 5-12-1 


3 


3,1% 


17 


PBL7 


98 


5 


VH5-12.1 


3 


3.1% 


17 


HAN 


119 


5 


VH5-12-1 


3 


3.1% 


97 


VERG3 


98 


5 


VH 5-12-1 


3 




17 


PBL3 


98 


5 


VH5-12-1 


3 


3,1% 


17 


VERG7 


98 


5 


VH5-12-1 


3 


3.1% 


17 


PBL5 


94 


5 


VH5-12-1 


0 


0,0% 


1 / 


CD-4 


98 


5 


VH5-12-1 


4 


4,10/0 


17 


CLL10 


98 


5 


VH5-12-1 


4 


4.1% 


17 


PBL11 


98 


5 


VH5-12-1 


4 


4.1% 


17 


C0RD6 


98 


5 


VH5-12-1 


.4 


4,1% 


17 


VERG2 


98 


5 


VHS-12-1 
9o 


5 


5,1% 


17 
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Table 2C: (continued) 



83P2 

VERG9 

CLL6 

PBL8 

Ab2022 

CAV 

HOW 

PET 

AN6 

KER 

5.M13 

Au2.1 

WS1 

TD Vn 

TEL13 

ESS 5.237 

VERG1 

CD4-74 

257-D 

CLL4 
CLL8 
Ab2 

Vh383ex 

CLL3 

Au59.1 

TEL16 

M61 

TuO 

P2-51 

P2-54 

Pl-56 

P2-53 

Pl-51 

Pl-54 

P3-G9 

P3-9 



aa' Computed 
family' 



Germline 
gene' 



Diff. to 
germline* 



% diff. to Reference' 
germline* 



119 

98 

98 

98 

120 

127 

120 

127 

121 

121 

118 

118 

126 

98 

lie 

112 

98 

117 

125 

98 

98 

124 

98 

98 

122 

117 

104 

99 

122 

122 

119 

122 

123 

123 

127 

119 



5 


VH5-12-1 


0 




103 


5 


VH5-12-1 


b 




17 


5 


VH 5-12-1 


b 




17 


5 


VH 5-12-1 


7 


7 lO/o 


17 


5 


VH5-12-1 


3 




100 


5 


VH5-12-4 


0 




97 


5 


VH5-12-4 


0 




97 


5 


VH5-12-4 


0 




97 


5 


VH5-12-4 


0 




97 


5 


VH5-12-4 


0 




97 


5 


VH5-12-4 


0 




107 


5 


VH5-12-4 


1 




49 


5 


VH5-12-1 


9 


Q 00/n 


no 


5 


VH5-12-4 


1 


1 no/n 


16 


5 


VH5-12-1 


9 




73 


5 


VH5-12-4 


2 




26 


5 


VH5-12-1 


10 


1 n 90/n 


17 


5 


VH5-12-1 


1 A 

10 


1 n 90/n 


42 


5 


VH5-12-1 


1 1 


1 1 2% 


6 


5 


VH5-12-1 


1 1 


1 1 2o/o 


17 


5 


\/H5-12-1 


1 1 


11.2% 


17 


5 


\/H5-l2-l 


1 1 

\ I. 


12.2% 


120 


5 


VH5-12-1 


1 i. 


12,2% 


120 


5 


VH5-12-2 


1 1 


11.2% 


17 


5 


VH5-12-1 


1 L 


12,2% 


49 


5 


VH5-12-1 




12,2% 


73 


5 


vn!)- 1^- 1 


0 


0.0% 


103 


5 


VH5-12-1 


5 


5.1% 


49 


5 


VH5-12-1 


13 


13.3% 


121 


5 


VH5-12-1 


11 


11.2% 


121 


5 


VH5-12-1 


9 


9.2% 


121 


5 


VH5-12-1 


10 


10.2% 


121 


5 


VH5-12-1 


19 


19.40/0 


121 


5 


VH5-12-1 


3 


3,1% 


121 


5 


VH5-12-1 


4 


4.1% 


121 


5 


VH5-12-1 


4 


4.1% 


121 



8 / 
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Name' 



aa' Computed 
family' 



Germiine 
gene' 



Diff. to 
germiine' 



% diff. to 
germiine* 



Reference' 



1-185-37 

1-187-29 

pi-58 

P2-57 

P2-55 

P2-56 

P2-52 

P3-60 

P1-57 

P1-55 

MD3-4 

Pl-52 

aL5 

CLL7 

L2F10 

L3BG 

VH6.A12 

S5A9 

s6G4 

ss3 

6-lGl 

F19L16 

LI 5 

M71 

ML1 

F19ML1 

15P1 

VH6.N1 

VHG.Nn 

VH6.N12 

VH6.N2 

VH6.N5 

VH6.N6 

VH6.N7 

VH6.N8 

VH6.N9 



125 

125 

128 

118 

123 

123 

122 

122 

123 

122 

128 

121 
98 
98 

100 

98 

119 

102 

99 

99 

101 

107 

120 

121 

120 

107 

127 

121 

123 

123 

125 

125 

127 

126 

123 

123 



5 

5 

5 

5 

5 

5 

5 

5 

5 

5 

5 

5 

5 

5 

5 

5 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 



VH5-12-4 
VH5-12-4 
VH5-12-4 
VH5-12-4 
VH5-12-1 
VH 5-12-1 
VH5-12-1 
VH5-12-1 
VH5-12-1 
VH5-12-1 
VH5-12-4 
VH5-12-1 
VH5-12-1 
VH5-12-1 
VH5-12-1 
VH5-12-1 
VH6-35-1 
VH6-35-1 
VH6-35-1 
VH6-35-1 
VH6-35-1 
■ VHG-35-1 
VH6-35-1 
VH6-35-1 
VH6-35-1 
VHG-35-1 
VHG-35-1 
VHG-35-1 
VHG-35-1 
VHG-35-1 
VHG-35-1 
VHG-35-1 
VHG-35-1 
VHG-35-1 
VHG-35-1 
VH6-35-1 

S2. 



0 

0 

10 

3 

5 

20 

n 

8 

4 

14 

12 

11 

13 

14 

1 

1 

13 
1 
1 
1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 



O.OO/o 
0.0% 
10,2% 
3.1% 
5,1% 
20.4% 
11,2% 
8.2% 
4,1% 
14,3% 
12.2% 
11,2% 
13.3% 
14.3% 
1,0% 
1,0% 
12,9% 
1.0% 
1,0% 
1,0% 
0.0% 
0,0% 
0,0% 
0,00/0 
0,0% 
O.OO/o 
0,0% 
0,0% 
0,00/0 
0,00/0 
0.0% 
O.OO/o 
0,00/0 
0,00/0 
0,00/0 
0,00/0 



124 
124 
121 
121 
121 
121 
121 
121 
121 
121 
5 

121 
17 
17 
46 
46 
122 
46 
46 
46 
14 
68 
69 
103 
69 
68 
104 
122 
122 
122 
122 
122 
122 
122 
122 
122 
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Name^ 


aa' 


Computed 
family^ 


Germline 
gene* 


Diff. to % diff. to 
germline* germline* 


ncierencc 


VH6.N10 


123 


6 


VH6-35-1 


0 


0,0% 


122 


VH8.A3 


123 


6 


VH 6-35-1 


0 


0,0^0 


122 


VH6.A1 


124 


6 


VH6-35-1 


0 


O.Wo 


122 


VH6.A4 


120 


6 


VH 6-35-1 


0 


0.0% 


122 


ESS G.IG 


116 


6 


VH6-35-1 


0 


0.0% 


26 


ESS 6.17 


120 


6 


VH 6-35-1 


0 


O.OtVo 


26 


ESS 6.6 


120 


6 


VH 6-35-1 


0 


0.0% 


26 


VHGL 6.3 


102 


6 


VH 6-35-1 


0 


o.o<yo 


26 


CB-201 


118 


6 


VH 6-35-1 


0 


0,0<Vo 


109 


VHG.N4 


122 


6 


VH6-35-1 


0 


o.o<yo 


122 


FS4 6 4 


109 


6 


VH 6-3 5-1 


1 


i.o<yo 


26 


VH6.A6 


126 


6 


VH 6-35-1 


1 


i.o«yo' 


122 


ESS 6.14 


120 


6 


VH 6-35-1 


1 


i.o<yo 


26 


ES4 6.6 


107 


6 


VH 6-3 5-1 


1 


1.0% 


26 


ESS 6 10 


112 


6 


VH 6-35-1 


1 


i.Qtyo 


26 


ES4 6.1 


107 


6 


VH 6-35-1 


2 


2.o<yo 


26 


ESS 6 13 


120 


6 


VH 6-35-1 


2 


2.o<yo 


26 


ESS 6.3 


120 


6 


VH6-35-1 


2 


2.m 


26 




116 


6 


VH6-35-1 


2 


2,0% 


26 




120 


6 


VH6-35-1 


2 


2.O0/O 


26 


E55 6.x 


111 


6 


VH6-35-1 


2 


2.o<yo 


26 


E55 6.1 1 


111 


6 


VH6-35-1 


3 


3,o<yo 


26 


VH6.A11 


118 


6 


VH 6-35-1 


3 


3.o<yo 


122 


AlO 


107 


6 


VH6-35-1 


3 


3,0% 


68 


E55 6.1 


120 


6 


VH6-35-1 


4 


4.0% 


26 


FK-001 


124 


6 


VH6-35-1 


4 


4.0% 


65 


VH6.A5 


121 


6 


VH6-35-1 


4 


4,0% 


122 


VH6.A7 


123 


6 


VH6-35-1 


4 


4,0% 


122 


HBp2 


119 


6 


VH6-35-1 


4 


4.0% 


4 


Au46.2 


123 


6 


VH6-35-1 


5 


5,0% 


49 


A431 


106 


6 


VH6-35-1 


5 


5,0% 


68 


VHG.A2 


120 


6 


VH6-35-1 


5 


5.0% 


122 


VH6.A9 


125 


6 


VH6-35-1 


8 


7.9% 


122 


VH6.A8 


118 


6 


VH6-35-1 


10 


9.9% 


122 


VHG-FF3 


118 


6 


VH 6-3 5-1 


2 


2.00/0 


123 


VH6.A10 


126 


6 


VH 6-35-1 


12 


11.9% 


122 



83 

SUBSTITUTE SHEET (RULE 26) 



wo 97/08320 
Table 2C: (continued) 



PCT/EP96/03647 



Name 


2 


Computed 
family' 


Germline 
gene* 


Uilr. to 
germline* 


O/n Hiff tn 
ifO U 1 1 1 « lU 

germline* 


Rpfprpficp' 


VH6-EB10 


117 


6 


VH6-35-1 


3 


3.0% 


123 


VH6-E6 


119 


6 


VH6-35-1 


6 


5.9% 


123 


VH6-FE2 


121 


6 


\/H6-35-1 


6 


5.9% 


123 


VH6-EE6 


116 


6 


VH6-35-1 


6 


5.9% 


123 


VH6-FD10 


118 


6 


VHG-35-1 


6 


5,9% 


123 


VH6-EX8 


113 


6 


VH6-35-1 


6 


5.9% 


123 


VH6-FG9 


121 


6 


VH6-35-1 


_8 


7.9% 


123 


VH6-E5 


116 


6 


VH6-35-1 


9 


8.9% 


123 


VH6-EC8 


122 


6 


VHG-35-1 


9 


8.9% 


123 


VHG-EIO 


120 


6 


VH6-35-1 


10 


9.9% 


123 


VH6-FF1 1 


122 


6 


VH6-35-1 


11 


10.9% 


123 


VH6-FD2 


115 


6 


VH6-35-1 


n 


10.90/0 


123 


CLL10 17-2 


88 


6 


VH6-35-1 


4 


4.0% 


29 


VH6-BB1 1 


94 


6 


VH6-35-1 


4 


4.0% 


123 


VH6-B4I 


93 


6 


VHG-35-1 


7 


G.9% 


123 


JU17 


102 


6 


VHG-35-1 


3 


3.0<yo 


114 


VH6-BD9 


96 


6 


VH6-35-1 


11 


10.9% 


123 


VH6-BB9 


94 


6 


VH6-35-1 


12 


n.9% 


123 



S>4 
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Table 3A: assignment of rearranged V kappa sequences to their germline counterparts 



Family' 


Name 


Rearranged' 


Sum 


I 


Vicl-I 


28 




1 


Vlcl-2 


0 




1 


Vkl-3 


1 




I 


Vkl-4 


0 




1 


Vkl-5 


7 




1 


Vkl.6 


0 




1 


VkI-7 


0 




1 


Vkl-8 


2 




1 


Vkl-9 


9 




1 


Vkl-10 


0 




1 


Vkl-ll 


1 




I 


Vkl-12 


7 




1 


VkI-13 


1 




1 


Vkl-14 


7 




j 


Vkl-I5 


2 




1 


Vkl-16 


2 




I 


Vkl-17 


16 




1 


Vkl-I8 


I 




I 


Vkl-19 


33 




I 


Vkl-20 


1 




i 


Vkl-21 


I 




I 


Vkl-22 


0 




1 


VkI-23 


0 


119 entries 


2 


Vk2-I 


0 




2 


Vk2-2 


1 




2 


Vk2-3 


0 




2 


Vk2-4 


0 




2 


Vk2-5 


0 




2 


Vk2-6 


16 




2 


Vk2-7 


0 




2 


Vk2-8 


0 




2 


Vk2-9 


1 




2 


Vk2-10 


0 




2 


Vk2-il 


7 




2 


Vk2-I2 


0 


25 entries 


3 


Vk3-I 


1 




3 


Vk3-2 


0 
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Family* 


Name 


Rearranged^ 


Sum 


J 


Vlf3-3 


35 




J 


Vk3-4 


115 




'I 
J 




0 




q 

J 


Vk3-6 


0 




3 


Vk3-7 


1 




3 


Vk3-8 


40 


192 entries 


4 


Vk4-1 


33 


33 entries 


5 


Vk5-1 


1 


1 entry 


6 


Vk6-1 


0 




6 


Vk6-2 


0 


0 entries 


7 


Vk7-1 


0 


0 entries 



s><r 
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Table 3B: assignment of rearranged V lambda sequences to their germline counterparts 



Family' 


Name 


Rearranged' 


Sum 


1 


DPLl 


1 




1 


DPL2 


14 




1 


DPL3 


6 




1 


DPL4 


1 




1 


HUMLV117 


4 




1 


DPL5 


13 




1 


DPL6 


0 




1 


DPL7 


0 




1 


DPL8 


3 




1 


DPL9 


0 


42 entries 


2 


DPL10 


5 




2 


VLAMBOA 2.1 


0 




2 


DPL11 


23 




2 


DPLl 2 


15 




2 


DPLl 3 


0 




2 


DPLl 4 


0 


43 entries 


3 


DPLIG 


10 




3 


DPL23 


19 




3 


Humlv318 


9 


Jo znWzS 


7 


DPLl 8 


1 




7 


DPL19 


0 


1 entries 


8 


0PL21 


2 




8 


HUMLV801 


6 


8 entries 


9 


DPL22 


0 


0 entries 


unassigned 


DPL24 


0 


0 entries 


10 


gVLX-4.4 


0 


0 entries 
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Table 3C: assignment of rearranged V heavy chain sequences to their germiine counterparts 



Family' Name Rearranged^ Sum 



1 


VH1-12-1 


38 


1 


VH 1-1 2-8 


2 


1 


VH1-12-2 


2 


1 


VH1-12-9 


2 


] 


VHl-12-3 


0 


1 


VH 1-1 2-4 


0 


1 


VHl-12-5 


3 


1 


VHl-12-6 


0 


1 


VHl-12-7 


23 


1 


VHl-13-1 


1 


1 


VHl-13-2 


1 


1 


VH1-13-3 


0 


1 


VH1-13-4 


0 


1 


VH1-13-5 


0 


1 


VHl-13-6 


17 


1 


VH 1-1 3-7 


0 


1 


VH1-13-8 


3 


1 


VH 1-1 3-9 


0 


1 


VH1-13-10 


0 


1 


VHl-13-1 1 


0 


1 


VH1-13-12 


10 


1 


VH1-13-13 


0 


1 


VH1-13-14 


0 


1 


VH1-13-15 


4 


1 


VH1-13-1G 


2 


1 


VH1-13-17 


0 


1 


VH1-13-18 


1 


1 


VH1-13-19 


0 




VH1-1X-1 


1 


2 


VH2-21-1 


0 


2 


VH2-31-1 


0 


2 


VH2-31-2 


1 


2 


VH2-31-3 


1 


2 


VH2-31-4 


0 


2 


VH2-31-5 


2 


2 


VH2-31-6 


0 


2 


VH2-31-7 


0 



110 entries 
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Family' 


Name 


Rearranged^ 


Sum 


2 


\/LJO TI.IA 

Vn^-j 1 - 1**- 


1 




2 


vnz-o 1 "0 


0 




2 


vn/-j 1 "3 


0 




2 




n 




2 




1 

1 




2 


Vnz-0 1- iz 


n 




2 


v/UO 11 11 


1 
1 


7 pnfr/P5 


3 


VH3-n-l 


0 




3 


VH3-M-2 


U 




3 


VH3-1 1-3 


r 

b 




3 


VH3-1 1-4 


n 
U 




3 


VH3-1 1-5 


1 

1 




3 


VH3-1 i-b 


1 

\ 




3 


VH3-1 1-7 


U 




3 


Vrl3-1 1-8 


r 
D 




3 


\ /111 1 *5 1 

VH3-13-1 


Q 




3 


\ /LI*! 11 O 

Vn3-lo-2 


1 
J 




3 


VH3-13-3 


n 
U 




3 


I ILA 111 A 

Vn3-13-4 


n 
U 




3 


VH3-13-D 


n 
U 




3 


VH3-13-6 


U 




3 


Vn3-1 J-7 


11 
JZ 




3 


\ /U 1 1 1 o 


A 
H 




3 


\/lJ1 11 Q 

VnJ-13-y 






3 


\/ui 11 in 
Vn3- 10- lU 






3 


\/U1 11 11 


A 
V 




3 


\/LI1 11 11 

VnJ- lo- IZ 
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Family' Name Rearranged' Sum 



4 VH4-21-5 1 

4 VH4-21-6 1 

4 VH4-21-7 0 

4 VH4-21-8 0 

4 VH4-21-9 0 

4 VH4-31-1 0 

4 VH4-31-2 0 

4 VH4-31-3 0 

4 VH4-31-4 2 

4 VH4-31-5 0 

4 VH4-31-6 0 

4 VH4-31-7 0 

4 VH4-31-8 0 

4 VH4-31-9 0 

4 VH4-31-10 0 

4 VH4-31-n 0 

4 VH4-31-12 4 

4 VH4-31-13 7 

4 VH4-31-14 0 

4 VH4-31-15 0 

4 VH4-31-16 0 

4 VH4-31-17 0 

4 VH4-31-18 0 

4 VH4-31-19 0 

4 VH4-31-20 0 57 entries 

5 VH5-12-1 82 
5 VH5-12-2 1 
5 VH5-12-3 0 

5 VHS-12-4 14 97 entries 

VH6-35-1 74 74 entries 
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1 i i li i 




1 


: : i 
: ; : 
: • : 




R 






i i i l| 


1 1 3! 2! 11 








: : 




571 


S 






21 


351 i 


51 


1 


52i 25! 51 9 


1 






I : 

441 \ 


li 


T 


53 




29i 


_„L L 






i 2i li 








31 1 






V 






1 55 






1 


: 1 ; 






i 1 


31 




W 








i i 


y™ 

1 


\ \ li 




2| 56 


i57i 






X 






i 










1 \ \ 












Y 










: j ; : 
; ; t : 


48 


52! i 








Z 






i i i 


1 i 1 1 i 1 
i i i 1 i 1 




i j 


! 


















\ \ 451 39 












unknown (?) 






: 


i 


1 1 


1 i i 
i i i 






: 
\ 








not sequenced 




4 


2= 2 


2! 2 


2! 2I 1 




1 


1 


li 








sum of seq' 


53 


53 


i 55 


55 


55! 55 


55i 551 56 


561 561 56i 56 


56 


i 56 


561 57 


57I 


57 


1 57! 


oomcaa' 
mcaa' 

rel. oomcaa* 

pos occupied' 


: 53 

: T 


53 


:29 

........ 


^55 


35I 53 

"s"f"G" 


53 

"g 

: CD 

: cn 


i 5li 32 


I 521 251 45! 39 


; 48 


1 52 


561 44 




51 
1 


57| 


j S 1 1 


i s i s i - i - 


Y 


Y 


Wj S 


w 


• o 

; O 


■ o 
• o 


: O" 
: (*1 
: m 


: O 


#1 # 

: CD 
. CD : CTI 


93% 
57% 


II:: 

i ^ I ^ i ^ i ^ 

: CO : tn : O • O 
: 0>: I CO i 


': 0* 
: CD 
: CO 


i ^ 

; 0* 
: CO 

: a> 


• 0 5* 

: O: 1^ 
: r- ; 


■ 0 
0 


CD 
00 


: 0* 
: 0 
: 0 


1 


1 


[ s 


1 1 


i 3I 3 


1 3! 3| 4 


\ 31 7i ei 6 

1. 


1 7 


\ 4 


1 li 5 


1 


5 


1 1 



7^ 
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Framework II 



amino acid' 



C71 O CM n 

CO ^ ^ ^ 



inc£)r^coa>o — <N<<roo 



CO ^ in 
lo in in 



A 

n 
D 




1 ^ 


L-i. 1 _ 








1 












; L 




1 i 1 


1 i 1 








: 

i 














c 

D 


\ \ \ 
1 1 1.. « 


i-U- 








: 
: 
r 






: 
: 




1 j 




1 i 












l| 1 




1| 1 


E 




5d 






j 22 


1 1 1 




: : 

i 1 


F 


1 1 1 1 i 1 

— X — i. — i — j i__ i. 










1 


, 1 iM 








G 


i MM issi 








56 


1 


1 i 1 




l| i 57 


H_ _ 
1 


i 21 i 1 i j 






1 
I 




i i 1 


1 24i 1 \ 






i i i 






54 




1 




• \ \ \ 


K 




: 


1 54| 1 








j 


i : : 
: : : 
t ; : 


: : : i 
\ % I \ 


L 1 




i i iss 






2 






j : 




1 1 : 

M f i i 


^ J 

: ; : 
! ! : 
i \ : 








j 






1 


j 


: i 


N 


: : 
1 : 


: : i 
: : 1 








f 




21 




- ■ 




p 


1 501 49 


i i 1 2 




















Q 


56| 1 j i i i 


1 




! 1 1 






j 


: : : i 

i ; ! ^ 
L A 1 ,1.-,...,; 


R 












9 


1 l| 1 






S 




3M M 1 






I 1 1- 


\ : : 

i 11 i 


1 |52 




X 

V 




i 1 i i 










:' i : 
1 : \ 

.L. ...... .i....^.. ■'. 


i 8 


5 


i 








1 




: ! : 

3| i \ 










w 


1 i 1 i i i 




56 






\ \ \ 










X 












: j : 










Y 


\ ,\ j i i 1 




1 


! Il5 


132 






23| 




z 


: : : t : : 
:::::: 






i i 














I 1 : 


: ! 




i i i 


1 


571 57 


57 






unknown (?) 


1 1 1 - 
... .i 1 ^.i — 


. 1 I 




i i i 

i : : 
• i i 


; 

: 


; 








not sequenced 


i i i 






i ; 1 


: 


: 








sum of seq' 


571 57| 571 57 


571 57i 57! 57 


571 57! 57! 57 


571 57i 57i 57 


57 


57| 57 


57: 


oomcaa' 
mcaa' 

rel. oomcaa" 

pos occupied" 


56 

"q" 


50] 49; 55 

"pT'pTg" 


lM[55J.55ij56 

'FrGl l1"e" 


56j 54! SSi 22 

'wr'TTGTT' 


541 32I 57j 57 


57 


24i 52 

"hI's" 


57 

"g" 


00 

o> 


O" : : O" 

CO • CO ; CO 

CO : CO : C> 


: : O" 
in : CO : CD 


CO 


^ i ^ i ^\ ^ 

& \ a* : : 
CO : in i 00 : o> 


^ i ^ \ ^1 ^ 

: 0* 1 0 : 0 
in : CD I 0 : 0 
CT> : m : »— : 


# 
0 
0 


#! # 

CM 1 ^ 
\ (7) 


# 
0 
0 


2 


5| 21 3 


2i 2! 2 


2 


21 3! 2! 8 


21 Gi ll 1 


1 


5i 2 


1 
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CDRII 



amino acid' 


CD 

in m 


00 

in 


in 


0 

CD 


CD 


CN 
CD 


CO 


CO 


in 

CD 


CD 
CD 


CD 


00 

CD 


at 

CO 


0 




CM 


CO 


^ in 


A 


1 ^ ^ 






i 


1 

t«.« I— 






1 




1 




1 1 






1 li 

; : : 


__B 


i 








: 














1 






j : 


■ C 










1 














i 

: 






: : 


D 




2 
















1 










55 






E 








1 






















1 




: : 


F 






3 


: 
: 






















1 1 




G 


ij 
















1 














: 
I 


: ; 


H 




1 2 
































1 


l| 1 


















1 


1 


48 




3 






K 








1 


\ \ 




S3 








1 1 




i 1 11 


i 51! 


L 








\ li 


55 








1 






3 




i ij 


M 






i 
















7 


... . 




i 21 




N 


2\ 


40 


i 53 
















2 












1 1! 


P 








1 54 




1 




















— — 




Q 


: 

! 




i 1 
























: : : 
: : : 


R 


2i 




: i t 




3 


i 


56 














i i 2! 


S 




1 




2 


I56 






sei 






1 




! 

56! 






1! 


57! 1 





li 54 


ll 




1 






1 








51 




1 






521 1 i 


V 




— r ; 
















53 




2 




50; 








W 

- — — 














■ — . "T 

: 1 
: 1 




















X 


— T " 
















1 






t : 
t : 








Y 






54 












1 i 1 

; t 




1 




1 






Z 


: 1 : 

i 1 1 




i 1 




1 1 1 




i 


i 
1 










i I i 




! 1 

: 1 i 




: : I 
i : : 












unknown (?) 


III 




I \ \ 


: : : 

: I 1 
: : ! 








: 






not sequenced 


i i i 


1 


ii 


ll 


li 




i i li 1 








: 






sum of seq' 


571 571 


57i 57I 


56! 




56i 56j 


571 


57I 571 


561 561 


571 5?! 


57 1 




57! 


57i 57! 


oomcaa' 


49! 541 401 541 


53! 


54i 


56j SSj 


53 i 


56| 56i 53i 5ll 


481 


56l 50i 


551 52I 57| 51! 


mcaa* 


s i T i 


N 1 


Y 1 


N 


p 1 


s 1 


L i 


K i 


s i 


R i 


Vi 


T i 


1 1 


...... 

s i 




D I 


tT 


"s i K 1 


rel. oomcaa* 1 




#1 

0 : 


0* : 

j 
i 


i 

0* : 
tn : 
cn : 


CD • 
CD : 


0 • 

0 j 


#i 

O) • 


#1 

CO : 
<J> : 


^ \ 
0* : 
00 : 
CT> : 


#i 

00 ; 

CD : 


#i 

cn • 


# 1 
CT) i 


^ ■ 

o" 1 

i 

00 ; 


i 

0* : 
00 : 

cn : 


#i 

00 : 
00 : 


0" : 
CD : 


^ i 

O" j 


#! 

0 ; cn • 

[ CO : 


pos occupied' [ 71 4| 


6| 


21 


3j 


3I 

L 


i! 


2I 


3l 


2I 


2I 


4I 


5j 


3! 


2I 4| 


j\ 


si 


l| 6; 
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amino acid' 




1^ 


CO 




o — 


CM 
00 


< 


CO 




cn 

00 


00 


in 
00 


CD 

00 


00 


00 
00 


00 


0 

CD 




CN 


A 






















55 


57 


r 




571 






B 






































■ C 


































• 1 


0/ : 


D 










1| 
















Df 












E 










1 1 
























1 




F 


1 |54 










1 

■ 






















6 










1 




















: 
i 




_H_ 




1 i 

i j 














: I 
' 1 












: : 

: 1 




1 




1 








1 

1 


i 


1 3 








i 


^^....-A 




K 1 3| 1 




1 461 






: 
I 
I 








1 


i I 
\ \ 




L 




31 


1 




551 


53 






2 














\ ll 1 




IVI. ill 




i 1 


1 






1 


















N 










i 3 




3 


1 
















ill 




P 


i 
































: 1 




Q 


IH • 




li 1 
























> — ♦ ■ ■•——»-**-• 't 

\ 1 1 


R 


......-L1 ... 




i 2 




2 








1 
















S 




1 1 


57 


i 2 


1 


44 


55 




1 








2 






1 1 




T 




j 




j 1 




4 






53; 








55! i 








V 










2 






54 




1 










551 i 




W 












1 














: : 
: : 




X 










t 

I 












: I i i 1 


Y 


: 
t 




















1 \ S7l 561 i 


Z 


1 i 


t : : 
I : : 








i i 




1 : 1 I [ : 
1 1 1 : : i 




















— i — 1 — 




iinicnown (?) 




: 1 : j : 








! 












not sequenced 


i 1 1 


1 : i 1 
















: 




sum of seq' 


571 


571 


57i 57\ 


571 571 


571 


57 i 


57| 


57 


571 


571 571 


57! 


57i 


57 1 


57 


571 


57! 


57! 


oomcaa' | 


54j 541 54! 57i 


551 46i 53i 44 


551 


54i 


53i 


55= 57I 


57I 


55I 


57j 


55; 


571 


56! 57: 


mcaa' 


N i 


Qi 


F 1 


5 1 


L 1 K 1 


L 1 


s i 


s 1 


V 


T 1 


A i 


Al 


D i 


T 1 


A 1 


V i 


Y i 


Y 1 


c 1 


rel. oomcaa' | 


: 

in : 


^ i 

O" : 

in : 
GP>| 


: 

in : 


1100% 


#i #1 

CD ; — i 


o* i 
ro : 
cr> : 


: 

• 


^ i 

o* : 
to : 
<T> i 


#1 

in i 

CD : 


#1 

CO : 


O" ; 
CO : 
CT> • 


100% J 


1100% 1 


-2 i 
0* : 

CO ; 

: 


100% 


#1 

CD i 

cn i 


100% 


-5 
0* : 
00 : 
CD : 


0* : 
0 : 


pos occupied" i 


2i 


2\ 


4i 


li 


3i 8i 


4| 


A. 


31 


31 


3\ 


t 

3| 




li 


21 


1: 


3i 


li 


2i 


ll 
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CDR III 



amino acid' SS^SS;SS2<«^<^Q"^^^=^ '^2 



A i56 


i 3! 3 


3l 2 


5I 4 


2I 2 


4 




2 


ll 1 ll 1 


121 


j 


B 1 


i 1 


1 


— j — 


i 

i 








1 j 1 

I i i 


: : j 


C 1 


: : 

1 ! ' 


i! 


: 
I 
: 
I 










1 1 i 


: i I 
: : : 


D 








51 5 


51 4 


3i 2 


4 


3 


1 


1 

1 1 


21 


li i i41 


E 






ei 1 


li 21 li 


7 i 

1 1 


3 


1 


2 


V : : i i i 
1| i 1 1 j j j 


F . 






i 4 


l| li 12 


si 2 


2 




1 


l| 


1 i hi! 1 


G 






251 9 


loj si 10| 11 


4i 7 


7 


6 


1 


11 1 


2I l| 9 


: 1 


H 






li 


i i li 


1 

: 


i 


1 


i 


li i 1 




_„2| 


1 






1 1 


i 21 4i 1 


3i 2 


3 




1 


i 


: I 


1 


K 






21 1 


1 1 1 


i 2 


2 






ij 


1 i 




L 






2j 6 


71 3| 5i 3 


21 4 


1 


5 


3 


3! 


li i 




M 




i 1 


4! 1 3I 1 


i 2 


1 












si i 


N 


1 


i 3 


I i 1 
i 1 i 


2i 1 


1 


5 


1 


1 




i 2i i 




P 






i 4 


51 31 ij 1 


2i 1 


1 


1 


2 


3 


1 


2! li 




li 


Q 




! i 
! 1 


li li 1 1 


i 

1 


li 1 




1 3 


j i 






54| 4| 12 


2I 51 5I 3 


2! 3 


li 2 




. 1 iLil. i J 


1 1 


Z±l 

T 




li 11 4 




i ! 

81 11 2 


5i 7 


4i 2 


1 


ll li ! i 










ll l| 2 


1 


3l 4I 4 


31 3 


i 


1 


li li 1 i 

...! X i 








V 


1 


ll 41 2 


2 


51 41 4 


7i 3^ 


li 2 


1 








W 




1 li 2 


1 


21 2i 4 


4 

sj 1 


li 2 




2i ll 1 3! 


2 






X 


: : t 

] \ i 






1 






ill 








Y 




1 


4 


5! 3i G 


' — 1 — ' 

! 4| 2 


31 4 


8 


41 81 3i 5 


8 




2! 


z 1 1 


1 




! 1 i i 


1 

! 




: : : 

1 i ! 


















li 2I 4i 6! 9 


11 


16 


23 


271 29 


34 


31 


14 


4 




unknown (?) 
not sequenced 












i 1 i 1 


: : 

i i 




li 




1 


1 


1 








1 


1 


1 


li li 2i 3| 3 


i 6i 7 


1 8 


; gi 9 


10 


n 


11 


1 11 




sum of seq' 


57 


! 57 


1 56 


56 


. 56 


561 561 55! 54! 54 


i 5li 50 


i 49 


1 481 48 


47 


46 


: 46 


1 46 


46' 


oomcaa' 
mcaa' 

rel. oomcaa' 

pos occupied* 


56 
A 


[54 


: 25 

Fg" 


12 
....... 


: 10 

[g" 


j^ io] n 

rGT'Gl'G 


1 7I 9 


lli 16 


i 23 


i 271 29 


1 34 


31 


14 


hi 


41 


1 V 1 - 














i F 


i D 


: O' 
: CO 
: C7> 


1 # 

i ^ 


! ^ 

■ o" 
: in 
: "ll- 


. CN 


; ^ 
; 0 
: 00 


ill: 
^ : ^1 ^1 ^ i ^ 
1 0* : O" : : ■ O" 

; ^ ! 00 : 0 : n : 

: t- : 17- j CN : : — 


i j 

1 #i # 

: <N i CM 
! CN ! CO 


1 ^ 

: 


i #i # 

:<£> ': 0 
; in : CD 


1 ^ 
: O' 
: CN 
: 1^ 


: 

: 0 


1 # 
• 0 
: ro 


i ^ 

: 0" 
: ^ 
: to 


1 ^ 
: O" 

: cn 

• CO 


i 2 


\ 4 


12 


i 16 


i 16 


\ 16! lei 16! lel 18 


i 18 


i 13 


i 15 


1 13! 10 


1 9 


! 

i 8 


5 


\ 4 


: 4 
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amino acid' 



OOOOOOOO — '-'-'^ 



A 








: 


li 




1 


I 




B 








: 

i 
t 


1 








I 
I 




C 








I 

< i 










1 




D 








f 
j 


I 
1 






1 




E 








i 
i 


j 






i 
i 




F 








1 

i 


1 








1 

f 




G 






41 


— 1 — 

140 


ij 












H 


1 








1 




1 








1 


9 






i 

: 














K 








: 

3| 


i 
r 












L 
M 


4 






I 

i 

i 

i 


ij!9 

1 9 












N 








i 


li \ 












P 


3 


















2 


Q 








29| 














R 


1 


i 


4i 


i li 








S 


1 


^ ™, 

j 


11 


i I 




i 36 


33 


T 




i 

t : 
: 


i 

1! 


331 sl 


34 






V 


12 


{• 

t 


— 1 — . 

! 


1 136 




36l 




W 




46! 


i 1 1 1 1 

1 1 1 i i 






X 




i 


: t : ; : 
I : : : : 
: I { • : 






Y 

Z 




1 

i i 


i 

""T" 


1 i 1 

1 i 1 


j 


— 


unknown (?) 




1 


: 

: : 


J L 

: : : 
: i I 




-.VMM. 


— 


not sequenced 


101 111 lei 171 17 


20i 20i 21 


21 


21 


21 


22 


sum of seq' 
oomcaa' 
mcaa' 

rel. oomcaa^ 

pes occupied'^ ; 


47 


46| 41 i 40i 40! 


371 37i 36 


36i 36 


36 


351 


16 


46i 41 1 291 40 


33l 191 36* 


34| 36 


36 


33i 


Y I 

^ : 
CO : 


Wi G 1 Qi G 
•s i -e 5 1 ^ 

& • & ; ^ : • 
O: O: S-: O: 
O • O 1 CO j O 


T 1 L i V 

i 1 ^ 

^ : ^ ■ £ : 
& : & ] O . 
Cn : : O : 
00 : in : ^ : 


T i V 

&• o 
'^l o 
O): 


S: 

^ ■ 

O : 
O 


s 1 

^ ! 

O* : 
^ : 
O) : 


8 


li 11 61 1 


51 4! 1: 


31 1 


1 


21 



sum 
332 

113 
210 
176 
135 
674 

45 
282 
278 
540 

43 
204 
281 
334 
250 
986 
532 
488 
267 

455 
1 

466 
4 
426 
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amino acid 



A 




1 — 1 

! 


1 






1 


89 




1 


_iL'. 




1 


: 1 t 


B 


...... ^ 






















i i 




1 




• C 


— 












1 


















: 
: 


I I : 


D 




















2 












j 


E 


88 


1 






2 








4 


93 












92I 




F . 
































1 1; 


i l-i 


G 


1 












! 92 












94 


; 
: 






H 




































l_ 
K 


















j 






























i 




94 


94 














L 

M 




1 




91 




2 




















95' 


















3 








j 








l| i 


N 
















j 1 


















P 






1 1 










! i 

11 i 


: 
: 




94 






\ 

: 




Q 


. 3 




92 




1 1 


90 






i 


: 








3! 






ii_ j 


R 












1 








1 1 


1 




1 








i7i_j 


5 














92! 


i 












94 




T 














U»M~W>— •••"1 

; j 




















V 




90 






; 89 








lj 


91 
















W 




































X 
Y 










































I 






















Z 














i 

i 






















; : I 




\ : 
: : 




i 






















unknown I?) 






























not sequenced 


1 51 5j si 51 4 


\ 4 4 


! 4 


2j 


2 


1 2 


1 2 


2 


: 2 


1 2 


1 2 


: 2 


1 2 




sum of seq' 


1 92; 92i 921 92I 93 


1 931 93 


1 93 


i 95^ 


95 


i 95 


i 95 


; 95 


i 95 


1 95 


j 95l_95 
j 921 94 


1 95ll§Lii?: 
ri 951 771 9^6 
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>: Oj C 


i ^ 

: ^ 
>: C 

M ^ 


: ^ j -S 
: O* ; O' 
> : CT> : C 
- : O : O" 


i # 

>: C 


1 ^ 

: 5" 

>: cr 


1 # 

: 

\\ a 


^ 1 
1 c 


: 

: : -P : 
: : 0 : 0 
: : a> : CT> 
: CI : 00 : CD 


pos occupied" i 3! 


li : 


?i : 


2! : 


21 


1^ 


Ij 




l| 


Ij 




i 

2i : 


?! : 


2I : 


31 : 


3! 


li 


3i si 2 
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CDR II 



amino acid' ui m m m to cd co to CO to to 



_B 

c 



D 
E 



68 



i 1 



V! 



Jl 
M 



i 1 



Q 



731 1! 



651 1 



W 
X 



11 



601 1 



unknown (?) 



not sequenced 



58! 



73 



67 



8i 



2! 



65 



61 



73 



11 



2i 71 



i 1 



173 



66! 



72 



72 



11 



i 69! 



I 1 



I 69 



4i 



i 66 



2i 1 



sumofseq' 1741 741 74 



oomcaa' i.60|65[68 



mcaa 



i Y i N i D 



rel. oomcaa* 1 § j ^ j ^ 



CO i 00 : 0> 



pos occupied" I 7| 6|_ 5 



741 741 74 



72 



731 58 



Al V 



74' 741 74 



73 

T 



r» • oj • 00 



3i 2i 7 



721 67 



V i K 



1 1 



70i 



73 



71! 1 



2! 



74 



66 



741 741 74] 74 



731 651 69j 71 

■'R'rTT'T'TT 



3- i 3- ; & : o 

C7> • 00 i CO ■ to 

CD ; CO ■ ff> ! CT> 



2! 4! 41 



74i 74 741 74! 74[ 74; 



691 661 73i 71] 731 70i 



N! P i D i T i S i K 



^ ii f 



4i 4i 2! 4! 21 3 
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Franriework III 



aminoacid' SE:£SSSS<»" S 5 S S S g S S 5 Si 



A 1 i 1 r 


i 

i ; 














1 
1 




: 


74! 


„ 1 : :■■ 




i • * 

B 1 1 




— i — - 

i 




















j 






C 1 






























1 




731 


D 
















3 












/J 










E 






i 




















73 














F 






7ii 




























3| 




G 
























1 












H 










2 




1 


: 
; 














[ i — i- 




1 




i 1^ 


























1 2 




K 




: 
\ 










4 




















— i 

— i 


L 




ll 




74 




72 
















M 












1 






„.ii..._ 












N 


74 














63 












i i i 1 ii 




P 
















> 1 

1 


\ i_70 











— 1 






72 


1 




71 








j : 






i i i 1 i 

I '• '• ' i 






R 


r ' 
\ 1 








1 




1 




1 1 










i I 


ij 


S 


I 

I 




74 








1 


73 


1 l| 3 










T 












i 


1 i 1 




1 73 








74 


1 1 1 ll 




V 






2 














731 








i 1 70! i 




W 
































X 


t 


1 
























Y 


: 








! 












: : 

! 
t 


\ \ i 73j 70 


Z 


i 
i 














: : 
I : 








































unknown (?) 


i 












j j 














not sequencec 


i 






1 


: i 


: j 




1 1 








sum of seq' 
oomcaa^ 
mcaa* 

rel. oomcaa'^ 

pos occupied* 


i 741 74 


^l 74 


\\ 7a\ 741 741 71 


^i 741 7^ 


^{ 741 74; 7: 


1 741 74! 741 74! 74! 74| 7A 


74 


1 741 72 


! 71 


1 74i 74| 7li 72 


•i 63! 7: 


il 73| 73i 7( 


)! 7: 


731 74| 74I 70| 73| 7C 


)i73 


i N i Q 


F 


i S i L 1 Q| L 


i Nj S 


i V i T j p 


1 E 


D j T i A| Vi Y i Y 


i c 


i ^1 ^ 

: O: &• 

• O: r> 

• <-• C 


>i a 


ui^i i 

J o" : o" : ^ : ^ 
: O : O : O" • O" 
: O : O : <^ : 
» I I : CD : Cr 


i #1 i 

. : ID : a 
) : 00 : a 


) ■ ^ : ^ j -€ 
■ ; O" i O" : 0 
) : O) : O) 1 C£ 
> : 0 t ^ J...,? 


) ! 0 

> : 0 


£^ ' 

> ; cn : 0 : 0 L 


P - - 
5" : c 

n\ C 
31 


0 ': ^ 
: 0 

T>: cr 


> i -P 
: O" 
) : 0 

>; O) 


M ii : 


jI : 


31 li Ij 31 : 




21 2I 


21 _ 


zL 


2| 2I li 


2i 


31 2 
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amino acid' cn oi o a> g> g> <^ ^ 



o X — 




| not sequenced 
sum of seq' 
oomcaa^ 
mcaa* 

rel. oomcaa'' 

pos occupied' 



74] 74| 73| 72171171 
69 



691 I9l lOi 15i 15 



Rl Di Pi 6i G 



O I O ! ^ : ^ i -S 
3- i 5- : Si ° '■ " 

cn i rM : '-I <N l,S^. 



41 141 20i 19i 15 



72 



14 



17 



721 72| 72! 72Li^[^2i 72] 72] 72|2 



72i 



'Slsi' 3? 4 if 47I 531 54I 57! 5Gi 50i28[ 38L62: 

' • ■ ' ^ ' i F i D 



-si -si #i #i #i #i #! #i #i ^ 

: i\ UMiimMimMm- 



\q\ 16] 13= dnj 81 8l 4I 51 7| §LJLJ 
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Framework IV 



o 



in (£) 00 
O O O o 



o o 
o 



S:! ^2 sum 



A 










1 : 


2 




i 






B 










i 














C 
























D 










: 














E 










; 
: 














F 


2i 








I 










21 




6 






49 




















H 


2\ 






















1 


9 














il 










K 








V 


















L 


5 






















M 




i ^1 








" i 












N 


























P 


4 






6 
















1 


Q 






1 40 


















R 








2 


















S 


4 




1 






1 










43 


Ue 


T ' 












45 






45 








V 


21 




i 

: : 








i 46 




i 48! 




W 




•esl 1 
















X 






; 


1 
















Y 


' 19 


















■ 1 i 


Z 








1 

I 




1 






• i 1 


- 


2 






: 
















unknown (?) 








\ 












i 




not sequencec 






li 2: 


1 24123 


i 24 


\ 11 


ij 2£ 




p 25i 2Bi 26 


sum of seq' 




1! 6e 


)i 5( 


)l 49I 5C 




)i 4£ 


j| 4J 


n 4£ 


il 48l 451 47] 


oomcaa^ 


1 21 




i! 45 


)| 401 5C 


li 4: 


il 2( 


ji 4( 


jj 4; 


ij 48| 43| 46j 


mcaa* 


1 V 


j w 


i ° 


1 Q| G 


1 T 


i L 


1 V 


i T 


1 vj s] si 


rel. oomcaa^ 


1 1 


U S 

■ ; c 


>i # 

) : OC 

• i a 


^ i ^ 

> i ^ 1 g 

s-i a 

> : CN : C 
^: 00: *- 


• 

: & 
: CN 

: a 


) ; ■<J 

) : 


>i ^ 
•>■. a 


H 

>: C 

> I 


> = -p I i 

* : 0* : ^ : ^ 
) : 0 : O" : O* 

> : 0 : to : 00 


DOS occuoied 




9i 


li 


2I 4| 


l| 


3i 




3I 


l| 


li 


2i 2 



494 

147 
403 
186 
150 
571 

18 
304 
293 
632 

31 
436 
387 
539 
495 
1271 
640 
647 
398 

518 

585 
13 
580 
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Appendix to Tables lA-C 

A. References of rearranged sequences 

Rffprpr^ces of renrrar\aed hum nn knppa seauenres »sfrf for alignment 

1 . Alescio-Zonta. L ft Baglioni, C (1970) EurJ.Biochem.. 1 5. 450-463. 

2 Andrews. D.W. & Capra. J.D. (1981) Biochemistry. 20. 581G-5822. 

3 Andris. J.S.. Ehriich. P.H.. Ostberg. L ft Capra. J.D. (1992) J.Immunol.. 149. 4053-4059. 

4 Atkinson. P.M., Lampman. G.W.. Furie. B.C. Naparstek^Y.. Schwartz. R.S.. Stollar. B.D. £t 
Furie. B. (1985) J.CIin.lnvest., 75. 1138-1143. 

5 Aucouturier. P.. Bauwens. M, Khamlichi. AA.. Denoroy. U Spinelii. S.. Touchard. G.. 
Preud'homme. J.-L ft Cogne. M. (1993) J.Immunol.. 150. 3561-3568. 

6 Avila, M.A.. Vazques. J., Danielsson. L. Fernandez De Cossio. M.E. ft Borrebaeck. C.A.K. 
(1993) Gene. 127.273-274. 

7 Barbas lii. C.F.. Crowe. Jr.. J.E.. Cababa. D.. Jones, T.M.. Zebedee. S.L.. Murphy. B.R.. 
Chanock. R.M. ft Burton. D.R. (1992) Proc.Natl.Acad.Sci.Usa. 89. 10164-10168. 

8 Barbas. C.F.. lii. et al. (1993) J-Mol-Biol.. 230. 812-23. 

9 Bentley. D.L Et Rabbitts. T.H. (1980) Nature. 288. 730-733. 

10 Bentley. D.L ft Rabbitts. T.H. (1983) Cell. 32. 181-189. 

1 1 Bentley. D.L (1984) Nature. 307, 77-80. 

12 Bhat, N.M.. Bieber. M.M.. Chapman. CJ.. Stevenson. F.K. ft Teng. N.N.H. (1993) J.lmmunol.. 
151.5011-5021. 

13 Blaison. G.. Kuntz. J.-L 6t Pasquali. J.-L (1991) EurJ.lmmunol.. 21. 1221-1227. 

14 Braun. H.. Leibold. W.. Barnikol. H.U. ft Hilschmann. N. (1971) Z.Physiol.Chem.. 352, 647- 
651; (1972) LPhysioLChem. 353. 1284-1305. 

15 Capra, J.D. ft Kehoe. J.M. (1975) Adv.lmmunology. 20. 1-40. ; Andrews. D.W. Et Capra. J.D. 
(1981) Proc.Nat.Acad.Sci.Usa. 78, 3799-3803. 

1 6 Capra, J.D. £t Kehoe. J.M. (1975) Adv.lmmunology. 20, l -40. ; Ledford. D.K.. Goni. F.. 
Pizzolato. M.. Franklin. E.C.. Solomon. A. Et Frangione. B. (1983) J.Immunol.. 131. 1322- 
1325. 

17 Chastagner.P..Theze.J.aZouali.M.(l99i)Gene. 101.305-306. 
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18 Chen. p.p.. Robbins, D.L. Jirik. F.R.. Kipps. TJ. a Carson. D.A. (1987) J.Exp.Med, 1 66, 1900- 
1905. 

19 Chen, P.P.. Robbins. D.L. Jirik. F.R.. Kipps. T.J. ft Carson. D.A. (1987) J.Exp.iVled, 1 66, 1900- 
1905: Liu, M.-F., Robbins. D.L. Crowley. J J.. Sinha, S., Kozin. F., Kipps. TJ., Carson. D.A. ft 
Chen.P.P. (1989) J.Immunol., 142, 688-694. 

20 Chersi, A. ft Natali, P.G. (1978) Immunochemistry. 15, 585-589. 

21 Co, M.S.. Deschamps, M.. Whitley. R.J. ft Queen. C. (1991) Proc.Natl.Acad.Sci.Usa. 88, 
2869-2873. 

22 Cuisinier, A.-M., Fumoux. F., Fougereau. M. Et Tonnelle. C (1992) Mol.lmmunol., 29, 1363- 
1373. 

23 Davidson. A.. Manheimer-Lory, A., Aranow, C, Peterson, R.. Hannigan. N. ft Diamond, B. 
(1990) J.Ciin.lnvest.. 85, 1401-1409. 

24 Denomme, GA, Mahmoudi, M., Edwards, J.Y., Massicotte, H., Cairns, E. ft Bell. D.A. (1993) 
Hum.Antibod.Hybridomas. 4, 98-103. 

25 Dersimonian, H., Mcadam, K.P.WJ., Mackworth-Young, C. ft Stollar, B.D. (1 989) 
J.lmmunol., 142,4027-4033. 

26 Dreyer. WJ., Gray, W.R. ft Hood. L (1967) Cold Spring Harbor Symp. Quantitative Biol., 
32. 353-367. 

27 Ebeling, S.B., Schutte, M.E.M. ft Logtenberg, T. (1993) EurJ.lmmunol., 23. 1 405-1408. 

28 Eulitz, M. ft Kley. H.-P, (1977) Immunochem., 14, 289-297. 

29 Eulitz, M. ft Linke, R.P. (1982) Z.PhysioI.Chem.. 363, 1347-1358. 

30 Eulitz, M., Breuer, M., Eblen, A., Weiss, D.T. ft Solomon. A. (1990) In Amyloid And 
Amyloidosis. Eds. J.B.Natvig, O.Forre, G.Husby, A.Husebekk, B.Skogen, K.Sletten ft 
P.Westermark. Kluwer Academic 

31 Eulitz, M., Gotze. D. ft Hilschmann. N. (1972) Z.Physiol.Chem.. 353, 487-491; Eulitz. M. ft 
Hilschmann. N. (1974) Z.Physiol.Chem.. 355, 842-866. 

32 Eulitz, M., Kley. H.P. ft Zeitler, HJ. (1979) Z.Physiol.Chem., 360, 725-734. 

33 Ezaki, I., Kanda, H., Sakai, K., Fukui, N.. Shingu. M., Nobunaga, M. ft Watanabe. T. (1991) 
Arthritis And Rheumatism. 34, 343-350. 

34 Feigenhauer, M., Kohl, J. ft Ruker. F. (1990) NucLAcids Res., 18, 4927. 

35 Ferri, G., Stoppini, M., ladarola. P., Bellotti, V. ft Merlini, G. (1989) Biochim.Biophys.Acta, 
995, 103-108. 
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36 Gillies. S.D.. Dorai. H.. Wesolowski. J.. Majeau. 6.. Young. D., Boyd. J.. Gardner, J. ft James. 
K. (1989) Bio/Tech.. 7. 799-804. 

37 Goni. F. ft Frangione. B. (1 983) Proc.NaUcad.Sci.Usa, 80. 4837-4841 . 

38 Goni, F.R., Chen, P.P.. Mcginnis, D., Arjonilla, M.L., Fernandez. J.. Carson, D., Solomon. A.. 
Mendez, E. ft Frangione, B. (1989) J.lmmunol.. 142, 3158-3163. 

39 Gorman. S.D.. Clark, M.R.. Routledge. E.G.. Cobbold. S.P. ft Waidmann. H. (1991) 
Proc.Natl.Acad.Sci.Usa, 88. 4181-4185. 

40 Gottlieb. P.D.. Cunningham. B.A., Rutishauser. U. ft Edeiman, G.M. (1970) Biochemistry. 9. 
3155-31G1. 

41 Griffiths. A.D.. Malmqvist. M.. Marks. J.D.. Bye. J.M.. Embleton. MJ.. Mccafferty. J., Baier. 
M.. Holliger, K.P., Gorick, B.D.. Hughes-Jones. N.C.. Hoogenboom, H.R. ft Winter. G. (1993) 
Embo J.. 12, 725-734. 

42 Hieter. P.A., Max. E.E., Seidman. J.G.. Maizel. J.V.. Jr. ft Leder, P. (1980) Cell. 22. 197-207; 
Klobeck. H.G, Meindl. A.. Combriato. G.. Solomon. A ft Zachau, H.G. (1985) NucUcds 
Res.. 13, 6499-6513; Weir. L. ft Leder, P. (1986) 

43 Hilschmann. N. ft Craig. LC. (1965) Proc.Nat.Ac3d.Sci.Usa. 53. 1403-1409; Hilschmann. N. 
(1967) Z.Physiol.Chem., 348. 1077-1080. 

44 Hilschmann. N. ft Craig. LC. (1965) Proc.Nat.Ac3d.Sci.Usa. 53, 1403-1409; Hilschmann. N. 
(1967) Z.Physiol.Chem.. 348, 1718-1722; Hilschmann, N. (1969) Naturwissenschaften, 56, 
195-205. 

45 Hirabayashi, Y., Munakata. Y., Sasaki, T. ft Sano, H. (1992) NucLAcids Res., 20, 2601. 

46 Jaenichen, H.-R., Pech, M., Lindenmaier. W.. Wildgruber. N. ft Zachau. H.G. (1 984) 
Nuc. Acids Res.. 12. 5249-5263. 

47 Jirik. F.R.. Sorge. J.. Fong, S., Heitzmann, J.G., Curd, J.G., Chen, P.P., Goldfien, R. ft Carson, 
D.A (1986) Proc.Nat.Acad.Sci.Usa, 83. 2195-2199. 

48 Kaplan. A.P. ft Metzger. H. (1969) Biochemistry. 8. 3944-3951. : Klapper, D.G. ft Capra. 
J.D. (1976) Ann.lmmunol.(lnst.Pasteur), 127c, 261-271. 

49 Kennedy, M.A. (1991) J.Exp.Med., 173, 1033-1036. 

50 Kim, H.S. ft Deutsch, H.F. (1988) Immunol., 64. 573-579. 

51 Kipps. TJ., Tomhave. E., Chen, P.P. ft Carson. D.A. (1988) J.Exp.Med., 167, 840-852. 

52 Kipps, TJ., Tomhave, E., Chen, P.P. ft Fox. R.I. (1989) J.Immunol., 142, 4261-4268. 

53 Klapper DG ftCapra. J.D. (1976) AnnJmmunol.(lnst.Pasteur). 127c, 261-271. 

't 
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54 Klein. U.. Kuppers. R. ft Rajewsky. K. (1 993) EurJ.lmmunol.. 23. 3272-3277. 

55 Klobeck, H.G, Meindl. A.. Combriato. G.. Solomon. A. ft Zachau, H.G. (1985) Nucl^Acids 
Res.. 13. 6499-6513. 

56 Klobeck. H.G.. Bornkammm. 6.W.. Combriato, G., Mocikat. R.. Pohlenz, H.D. ft Zachau. H.G. 
(1985) NucLAcids Res., 13, 6515-6529. 

57 Klobeck. H.G.. Combriato, G. a Zachau, H.G. (1984) NucAcids Res., 12. 6995-7006. 

58 Klobeck, H.G., Solomon. A. ft Zachau. H.G. (1 984) Nature, 309. 73-76. 

59 Knight, G.B. Aghello, V.. Bonagura, V.. Barnes, J.L, Panka, DJ. ft Zhang. Q.-X (1993) 
J.Exp.Med.. 178, 1903-1911. - 

60 Kohler. H., Shimizu. A.. Paul. C. ft Putnam, F.W. (1970) Science, 169. 56-59. (Kaplan, A.P. 
a Metzger. H. (1969) Biochemistry. 8. 3944-3951.) 

61 Kratzin, H., Yang. C.Y., Krusche. J.U. ft Hilschmann. N. (1980) Z.PhysioI.Chem.. 361, 1591- 
1598. 

62 Kunicki. T.J., Annis, D.S., Gorski, J. a Nugent. DJ. (1991) J.Autoimmunity. 4, 433-446. 

63 Larrick. J.W.. Wallace. E.F.. Coloma. MJ.. Bruderer, U.. Lang. A.B. ft Fry. K.E. (1992) 
Immunological Reviews. 130, 69-85. 

64 Laure, CJ., Watanabe, 5. a Hilschmann. N. (1973) Z.PhysioI.Chem., 354. 1503-1504. 

65 Ledford. D.K., Goni. P., Pizzolato, M.. Franklin. E.C., Solomon. A. a Frangione. B. (1983) 
J.Immunol.. 131, 1322-1325. 

66 Ledford, D.K., Goni, P., Pizzolato, M., Franklin, E.C., Solomon, A. a Frangione, B. (1 983) 
J.lmmunoL, 131, 1322-1325. 

67 Ledford, D.K., Goni, F., Pizzolato, M., Franklin, E.C.. Solomon. A. a Frangione, B. (1 983) 
J.lmmunoL, 131, 1322-1325. Pons-Estel, B.. Goni, F.. Solomon, A. a Frangione, B. (1984) 
J.Exp.Med.. 160, 893. 

68 Levy. S.. Mendel. E.. Kon, S.. Avnur, Z. ft Levy. R. (1988) J.Exp.Med.. 168. 475-489. 

69 Liepnieks. J.J.. Dwulet, F.E. a Benson. M.D. (1990) Mol.lmmunol.. 27. 481-485. 

70 Manheimer-Lory, A., Katz. J.B.. Piliinger, M., Ghossein, C. Smith. A. a Diamond, B. (1991) 
J.Exp.Med., 174, 1639-1652. 

71 Mantovani, L, Wilder, R.L. a Casali, P. (1993) J.Immunol., 151, 473-488. 

72 Mariette, X.. Tsapis. A. a Brouet, J.-C. (1993) EurJ.lmmunol.. 23. 846-851. 

73 Marks, J.D., Hoogenboom. H.R.. Bonnert. T.P.. Mccaffcrty, J., Griffiths. A.D. ft Winter, G. 
(l99l)J.Mol.Biol., 222. 581-597. ✓ 

SUBSTITUTE SHEET (RULE 26) 



PCT/EP96/03647 

WO 97/08320 

74 Marsh. P.. Mills. F. ft Gould. H. (1985) Nuc^cids Res.. 13. 6531-6544. 

75 Middaugh. CR. ft Litman. 6.W. (1987) J.Biol.Chem.. 262. 3671-3673. 

76 Miistein, C. ft Deverson. E.V. (1971) BiochemJ.. 123, 945-958. 

77 Miistein. C. (1969) Febs Letters. 2. 301-304. 
78- Miistein. C. (1969) Febs Letters. 2. 301-304. 

79 Miistein, CP. ft Deverson, E.V. (1974) EurJ.Biochem., 49. 377-391 . 

80 Moran. MJ.. Andris. J.S.. Matsumato. Y.-l., Capra, J.D. ft Hersh. E.M. (1993) Mol.lmmunol.. 
30. 1543-1551. 

81 Nakatani. T.. Nomura. N., Horigome, K., Ohtsuka. H. ft Noguchi. H. (1989) Biotech.. 7. 
805-810. 

82 Newkirk. M., Chen. P.P.. Carson. D.. Posnett, D. ft Capra. J.D. (1986) Mol.lmmunol.. 23. 
239-244. 

83 Newkirk. M.M.. Gram. H.. Heinrich. G.F.. Ostberg, L. Capra. J.D. Et Wasserman. R.L (1988) 
J.CIin.lnvest., 81. 1511-1518. 

84 Newkirk. M.M.. Mageed. R.A.. Jefferis. R.. Chen. P.P. Et Capra. J.D. (1987) J.Exp.Med.. 166. 
550-564. 

85 Olee. B.T.. Lu. E.W., Huang, D.-F.. Soto-Gil. R.W.. Deftos, M.. Kozin. F.. Carson. D.A. ft Chen. 
P.P. (1992) J.Exp.Med.. 175. 831-842. 

86 Palm, W. ft Hilschmann. N. (1973) Z.Physioi.Chem.. 354. 1651-1654; (1975) 
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Claims 



A method of setting up one or more nucleic acid sequences encoding one or 
more (poly)peptide sequences suitable for the creation of libraries of 
(poly)peptides said (poly)peplide sequences comprising amino acid 
consensus sequences, said method comprising the following steps: 

(a) deducing from a collection of at least three homologous proteins one 
or more (poly)peptide sequences comprising at least one amino acid 
consensus sequence; 

(b) optionally, identifying amino acids in said (poly)peptide sequences to 
be modified so as to remove unfavorable interactions between amino 
acids within or between said or other (poly)peptide sequences; 

(c) identifying at least one structural sub-element within each of said 
{poly)peptide sequences; 



a 



(d) backtranslating each of said (poly)peptide sequences into 
corresponding coding nucleic acid sequence; 

(e) setting up cleavage sites in regions adjacent to or between the ends of 
sub-sequences encoding said sub-elements, each of said cleavage 
sites: 

(ea) being unique within each of said coding nucleic acid sequences; 

(eb) being common to the corresponding sub-sequences of any said 
coding nucleic acids. 

A method of setting up two or more sets of one or more nucleic acid 
sequences comprising executing the steps described in claim 1 for each of 
said sets with the additional provision that said cleavage sites are unique 
between said sets. 

The method of claim 2 in which at least two of said sets are deduced from the 
same collection of at least three homologous proteins. 

The method according to any one of claims 1 to 3, wherein said setting up 
further comprises the synthesis of said nucleic acid coding sequences. 

The method according to any one of claims 1 to 4, further comprising the 
cloning of said nucleic acid coding sequences into a vector. 
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The ™mod according to any one o. claims 1 to 5 whe-eln removaj o. 
unfavorable interactions results in enhanced express«n ol 

(poly)peptides. 

7. The method according to an, one ol claims 1 lo 6, lunher comprising the 

tr "living at least two ol said cleavage sites located in regions adiacent 
to or between the ends ol said sub-sequences; and 

(g) exchanging said sub-sequences by different sequences; and 

(h) optionally, repeating steps (f) and (g) one or more times. 

The method according to claim 7, wherein said different ^''"^'-^Z 
selected irom the 9'oup of different sub-sequences encoding the same or 
different sub-elements derived from the same or different (poly)pept,des. 

The method according to claims 7 or 8. wherein said different sequences are 

selected from the group of: .pnres- 

(i) ger^omic sequences or sequences derived from g«"°'^'^;^.^"^;'^!^'_^ 

(ii) rearranged genomic sequences or sequences denved from 
rearranged genomic sequences; and 

(iii) random sequences. 

The method according to any one of claims 1 to 9 fuHher comprising the 
expression of said nucleic acid coding sequences. 

11 . The method according to any one ol claims 1 to 10 further comprising the 
T "Screening, after expression, the resultant (poly)peptides lor a desired 

(K) optS;. repeating steps (I) to (1) one or more «mes with nucleic acid 
sequences encoding one or more (poly)peptldes obtained ,n step 0). 

The method according to claim 11. wherein said desired property is selected 
^r m the group ol optimized alftnity or specilicily for a targe, molecul 
opLzed en.yma.ic activity, optimized expression yields, optlm.zed stab,l,ty 
and optimized solubility. 
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* I -^o i tr 1? wherein said cleavage 
13. The method according to any one of claims 1 tc 12. wn.re 

sites are sites cleaved by restriction enzymes. 

1 4 The method according » any one o, claims 1 .0 13, wherein said su.c.ur^ 
sub-elements comprise between 1 and 1 50 amino acids. 

,5; The method according .0 claim 14, wherein said structural sub-elements 

comprise between 3 and 25 amino acids. 
1 e. The method according to any one ot claims 1 to 15. wherein said nucleic acid 

is DNA. 

„ The method according to any one o. claims J,\rnrrtlcrr 
(poiv)peptldes have an amino acid pattern charactensfo of a partrcuiar 

species. 

18 The method according to claim 17. Wherein said species is human. 

immunoglobulin superfamily. 

.0. The method according to claim 19, wherein said 

,ne immunoglobulin superfamily are members or denvatrves 

immunoglobulin family. 

sub-elements are framework regions (FR) 1. 2. J. or f 
determining regions (CDR) 1, 2. or 3. 

22 The method according to claim 20 or 21. wherein said (poly)peptldes are or 

■ r^virvi^r.'^^rvrvHinH^^ 

CK. ex. CH1 or any combination of said HuCAL consensus genes. 

23 The method according to any one of claims 20 to 22. wherein said derivative 

■ of sa^ immunoglobulin family or said combination is an Fv. d.sulph.de-l.nked 
Fv single-chain Fv (scFv). or Fab fragment. 
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. The .emcd « .0 22 .0 as w.--^^^^^^ 

:=r:r ^00— ^^^^^^^^ - 

heavy chain CDR3 sub-element. 

«ioim« 1 to 24 wherein at least part of 
, The method according to any ,o a sequence 

said (poly)peptide sequences o (P^^'^l'' ,,,,.„nai moiety, 

encoding at least one additional mo.ety or to at least 

respectively. 

. • » OR »n c»R wherein said additional moiety is a 
The method according to claims 25 to 26. wherein 

homo- or hetero-association domain. 

« ^iaim«i 10 to 27 wherein the expression 
,8. The method according to any one of claims 10 to ^J- w 

of said nucleic acid sequences results .n the generation of a reperto. 

:!jo^ca. activities and/or spec«icities. preferably in the generation of a 

repertoire based on a universal framework. 
28. A nuolete acid sequence obtainable by me method according .o any of claims 

1 to 28. 

30. A collection of nucleic acid sequences cb,a«.able by the method acccrdinq to 

any of claims 1 to 28. 

3, . A recombinant vector .obtainable by the method according to any o. claims 5 
to 28. 

32. A collection o. recombinant vectors obtainable by the method according to 
any of claims 5 to 30. 

33. A host cell transfomted w«h the recombinant vector according to claim 31. 
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A collection of host cells transformed »i.h the colleoSor, o. rccon.b:nan, 

vectors according to claim 32. 

A method of producing a (poWpeptide or a collec«on of (P^W^P^f^^^ 
deled in any o, claims 1 to 28 comprising culturlng the ^^^^'^'""^^Z 
cL 33 or the collection of host cells according to claim 34 ""d^' "''abte 
Editions and. isolating said (pol,)pept,de or said collection of 
(poly)peptides. 

A (poly)peptide devisable by the method according to any one of claims 1 to 
by the method according to any one ol claims 4 to 28 or 35. 

„ Acollecion 0, (po,y)peptldes devisable by the method according to any one 
of claims 1 10 3. encoded by the collection of nucleic acd sequences 
M to Claim 30 or obtainable by the method according to any one of 
Claims 4 to 28 or 35. 

38 A vector suitable for use in the method according to any of claims 5 to 28 and 
' 35 chircterized in that said vector Is essentially devoid of any cleavage s.te 

as defined in claim 1(e) and 2. 
39 . The vector according to claim 38 which is an expression vector. 

40 A kit comprising at least one of; 

(a) a nucleic acid sequence according to claim 29; 

(b) a collection of nucleic acid sequences according to claim 30. 

(c) a recombinant vector according to claim 31; 

(d) a collection of recombinant vectors according to claim 32; 

(e) a (poly)peptide according to claim 36; 

(f) a collection of (poly)peptides according to claim 37; 

(g) a vector according to claim 38 or 39; and optionally. 

(,) a suitable host cell for carrying out the method according to claim 35. 



41 . Amethod of designing two or more genes encoding a collection of two or more 
proteins, comprising the steps of: 
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(a) either 

(aa) identifying two or more homologous gene sequences, or 

(ab) analyzing at least three homologous genes, and 
deducing two or more consensus gene sequences therefrom. 

(b) optionally, modifying codons in said consensus gene sequences 
to remove unfavourable interactions between amino acids in the 
resulting proteins, 

(c) identifying sub-sequences which encode structural sub- 
elements in said consensus gene sequences 

(d) modifying one or more bases in regions adjacent to or between 
the ends of said sub-sequences to define one or more cleavage 
sites, each of which: 

(da) are unique within each consensus gene sequence. 

(db) do not form compatible sites with respect to any single 
sub-sequence, 

(dc) are common to all homologous sub-sequences. 



42. A method of preparing two or more genes encoding a collection of two or more 
proteins, comprising the steps of : 

(a) designing said genes according to claim 41 . and 

(b) synthesizing said genes. 



43 . A collection of genes prepared according to the method of claim 42. 



44 . A collection of two or more genes derived from gene sequences which: 

(a) are either homologous, or represent consensus gene sequences 
derived from at least three homologous genes, and 
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(b) carry cleavage sites, each of which: 

(ba) lie at or adjacent to the ends of genetic sub-sequences which 
encode structural sub-elements. 

(bb) are unique within each gene sequence, 

(be) do not form compatible sites with respect to any single sub- 

sequence, and 
(bd) are common to all homologous sub-sequences. 

particular species. 

46. The oonecion o< genes accordi^ .o daim 45 in which said species is human. 
... The co.ec.ion o, ,enes ^^'^'^ :Zlt: ^^^^^^'^ 

48 The collection o, genes according .0 claim 47 in which -'^ — 
elements correspond .o any combina.ion of framework regions 1, 2, 3, 4, 
and/or CDB regions 1 , 2. and 3 of antibody heavy chains. 

49 The collection o, genes according to claim 47 In ffj^f; 
elements correspond to any combination o. framework regions 2, 3, 4, 
and/or CDR regions 1 . 2, and 3 of antibody light cha,ns. 



SO. A collection of vectors comprising a collection o, gene sequences according 
to any Of Claims 43 to 49. 
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51 The collection of vectors according to claim 50 comprising the addit.onal 
feature that the vector does not comprise any cleavage site that is contained 
in the collection of genes according to any of claims 43 to 49. 



52. A method for identifying one or more genes encoding one or more proteins 
having a desirable property, comprising the steps of: 

(a) expressing from the collection of vectors according to either of claims 
50 or 51 a collection of proteins. 

(b) screening said collection to isolate one or more proteins having a 
desired property, 

(c) identifying the genes encoding the proteins isolated in step (b), 

(d) optionally, excising from the genes encoding the proteins isolated in 
step (b) one or more genetic sub-sequences encoding structural sub- 
elements, and replacing said sub-sequence(s) by one or more second 
sub-sequences encoding structural sub-elements, to generate new 
vectors according to either of claims 50 or 51 . 

(e) optionally, repeating steps (a) to (c). 

53. A method for identifying one or more genes encoding one or more antibody 
fragments which binds to a target, comprising the steps of: 

(a) expressing from the collection of vectors according to either of claims 
50 or 51 a collection of proteins, 

(b) screening said collection to isolate one or more antibody fragments 
which bind to said target. 

(c) identifying the genes encoding the proteins isolated in step (b). 

(d) optionally, excising from the genes encoding the antibody fragments 
isolated in step (b) one or more genetic sub-sequences encoding 
structural sub-elements, and replacing said sub-sequence(s) by one or 
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more second sub-sequences encoding stmctural sub-ger.crate new 
vectors according to either of claims 50 or 51 . 

(e) optionally, repeating steps (a) to (c). 

54. A kit comprising two or more genes derived from gene sequences which: 

(a) are either homologous, or represent consensus gene sequences 
derived from at least three homologous genes, and 

(b) carry cleavage sites, each of which: 

(ba) lie at or adjacent to the ends of genetic sub-sequences which 
encode structural sub-elements, 

(bb) are unique within each gene sequence, 

(be) do not fomi compatible sites with respect to any single sub- 

sequence, and 
(bd) are common to all homologous sub-sequences. 



55 A kit comprising two or more genetic sub-sequences which encode structural 
sub-elements, which can be assembled to form genes, and which carry 
cleavage sites, each of which: 

(a) lie at or adjacent to the ends of said genetic sub-sequences, 

(b) do not form compatible sites with respect to any single sub-sequence, 
and 

(d) are common to all homologous sub-sequences. 
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Figure 1: construction 
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Figure 6: oligonucleotides for gene synthesis 

OlKl 5'- GAATGCATACGCTGATATCCAGATGACCCAGAG 
CCCGTCTAGCCTGAGC -3 ' 

01K2 5'- CGCTCTGCAGGTAATGGTCACACGATCACCCAC 
GCTCGCGCTCAGGCTAGACGGGC -3 ' 

01K3 5 ' - GACCATTACCTGCAGAGCGAGCCAGGGCATTAG 
CAGCTATCTGGCGTGGTACCAGCAG -3 ' 

01K4 5 ' - CTTTGCAAGCTGCTGGCTGCATAAATTAATAGT 
TTCGGTGCTTTACCTGGTTTCTGCTGGTACCACGCCAG -3 ' 

01K5 5 ' - CAGCCAGCAGCTTGCAAAGCGGGGTCCCGTCCC- 
GTTTTAGCGGCTCTGGATCCGGCACTGATTTTAC -3 ' 

01K6 5'- GATAATAGGTCGCAAAGTCTTCAGGTTGCAGGC- 
TGCTAATGGTCAGGGTAAAATCAGTGCCGGATCC -3 ' 

02 Kl 5 ' - CGATATCGTGATGACCCAGAGCCCACTGAGCCT- 
GCCAGTGACTCCGGGCGAGCC -3 ' 

02K2 5 ' - GCCGTTGCTATGCAGCAGGCTTTGGCTGCTTCT- 
GCAGCTAATGCTCGCAGGCTCGCCCGGAGTCAC -3 ' 

02K3 5'- CTGCTGCATAGCAACGGCTATAACTATCTGGAT- 
TGGTACCTTCAAAAACCAGGTCAAAGCCC -3 ' 

02 K4 5 ' - CGATCCGGGACCCCACTGGCACGGTTGCTGCCC- 
AGATAAATTAATAGCTGCGGGCTTTGACCTGGTTTTTG -3 ' 

02 K5 5 ' - AGTGGGGTCCCGGATCGTTTTAGCGGCTCTGGA- 
TCCGGCACCGATTTTACCCTGAAAATTAGCCGTGTG -3 ' 

02K6 5 ' - CCATGCAATAATACACGCCCACGTCTTCAGCTT- 
CCACACGGCTAATTTTCAGGG -3 ' 

03K1 5 ' - GAATGCATACGCTGATATCGTGCTGACCCAGAG- 
CCCGG -3 ' 

03K2 5 ' - CGCTCTGCAGCTCAGGGTCGCACGTTCGCCCGG- 
AGACAGGCTCAGGGTCGCCGGGCTCTGGGTCAGC -3 ' 

03K3 5 ' - CCCTGAGCTGCAGAGCGAGCCAGAGCGTGAGCA- 
GCAGCTATCTGGCGTGGTACCAG -3 ' 
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Figure 6: (continued) 

03K4 5'- GCACGGCTGCTCGCGCCATAAATTAATAGACGC- 
GGTGCTTGACCTGGTTTCTGCTGGTACCACGCCAGATAG -3 ' 

03K5 5'- GCGCGAGCAGCCGTGCAACTGGGGTCCCGGCGC- 

GTTTTAGCGGCTCTGGATCCGGCACGGATTTTAC -3 ' 

03K6 5'- GATAATACACCGCAAAGTCTTCAGGTTCCAGGC- 

TGCTAATGGTCAGGGTAAAATCCGTGCCGGATC -3 ' 

04K1 5'- GAATGCATACGCTGATATCGTGATGACCCAGAG- 

CCCGGATAGCCTGGCG -3 ' 

04K2 5'- GCTTCTGCAGTTAATGGTCGCACGTTCGCCCAG- 
GCTCACCGCCAGGCTATCCGGGC -3 ' 

04K3 5 ' - CGACCATTAACTGCAGAAGCAGCCAGAGCGTGC- 
TGTATAGCAGCAACAACAAAAACTATCTGGCGTGGTACCAG - 

3 • 

04K4 5 ' - GATGCCCAATAAATTAATAGTTTCGGCGGCTGA- 

CCTGGTTTCTGCTGGTACCACGCCAGATAG -3 ' 

04K5 5'- AAACTATTAATTTATTGGGCATCCACCCGTGAA- 

AGCGGGGTCCCGGATCGTTTTAGCGGCTCTGGATCCGGCAC- 

3 ' 

04K6 5'- GATAATACACCGCCACGTCTTCAGCTTGCAGGG- 
ACGAAATGGTCAGGGTAAAATCAGTGCCGGATCCAGAGCC 

3 ' 

OlLl 5'- GAATGCATACGCTCAGAGCGTGCTGACCCAGCC- 
GCCTTCAGTGAGTGG -3 ' 

01L2 5 ' - CAATGTTGCTGCTGCTGCCGCTACACGAGATGG 
TCACACGCTGACCTGGTGCGCCACTCACTGAAGGCGGC -3 ' 
01L3 5'- GGCAGCAGCAGCAACATTGGCAGCAACTATGTG 
AGCTGGTACCAGCAGTTGCCCGGGAC -3' 
01L4 5'- CCGGCACGCCTGAGGGACGCTGGTTGTTATCAT 
AAATCAGCAGTTTCGGCGCCGTCCCGGGCAACTGC - 3 ' 
01L5 5'- CCCTCAGGCGTGCCGGATCGTTTTAGCGGATCC 
AAAAGCGGCACCAGCGCGAGCCTTGCG -3' 
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Figure 6: (continued) 

01L6 5 ' - CCGCTTCGTCTTCGCTTTGCAGGCCCGTAATCG 
CAAGGCTCGCGCTGG -3 ' 

02L1 5'- GAATGCATACGCTCAGAGCGCACTGACCCAGCC 
AGCTTCAGTGAGCGGC -3 ' 

02L2 5 ' - CGCTGCTAGTACCCGTACACGAGATGGTAATGC 
TCTGACCTGGTGAGCCGCTCACTGAAGCTGG -3 ' 
02 L3 5 ' - GTACGGGTACTAGCAGCGATGTGGGCGGCTATA 
ACTATGTGAGCTGGTACCAGCAGCATCCCGG -3 ' 

02 L4 5 ' - CGCCTGAGGGACGGTTGCTCACATCATAAATCA' 
TCAGTTTCGGCGCCTTCCCGGGATGCTGCTGGTAC -3 ' 

02L5 5 • - CAACCGTCCCTCAGGCGTGAGCAACCGTTTTAG' 
CGGATCCAAAAGCGGCAACACCGCGAGCC -3 ' 
02L6 5 • - CCGCTTCGTCTTCCGCTTGCAGGCCGCTAATGG- 
TCAGGCTCGCGGTGTTGCCG -3' 

03L1 5 • - GAATGCATACGCTAGCTATGAACTGACCCAGCC- 
GCCTTCAGTGAGCG -3' 

03L2 5 ' - CGCCCAGCGCATCGCCGCTACACGAGATACGCG- 

CGGTCTGACCTGGTGCAACGCTCACTGAAGGCGGC -3 ' 

03 L3 5 ' - GGCGATGCGCTGGGCGATAAATACGCGAGCTGG- 

TACCAGCAGAAACCCGGGCAGGCGC -3 ' 

03 L4 5 ' - GCGTTCCGGGATGCCTGAGGGACGGTCAGAATC- 

ATCATAAATCACCAGAACTGGCGCCTGCCCGGGTTTC -3 ' 

03 L5 5'- CAGGCATCCCGGAACGCTTTAGCGGATCCAACA- 
GCGGCAACACCGCGACCCTGACCATTAGCGG -3 ' 

03L6 5 ' - CCGCTTCGTCTTCCGCCTGAGTGCCGCTAATGG- 
TCAGGGTC -3 ' 

0124 6H1 5'- GCTCTTCACCCCTGTTACCAAAGCCCAG 
GTGCAATTG -3 ' 

01AH2 5 ' - GGCTTTGCAGCTCACTTTCACGCTGCTGCCCGG 
TTTTTTCACTTCCGCGCCAGACTGAACCAATTGCACCTGGGC- 

TTTG - 3 ' 
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Figure 6: (continued) 

01AH3 5 ' - GAAAGTGAGCTGCAAAGCCTCCGGAGGCACTTT- 
TAGCAGCTATGCGATTAGCTGGGTGCGCCAAGCCCCTGGGCAG 

GGTC -3 ' 

01AH4 5 • - GCCCTGAAACTTCTGCGCGTAGTTCGCCGTGCC- 
AAAAATCGGAATAATGCCGCCCATCCACTCGAGACCCTGCCC- 

AGGGGC -3' 

01AH5 5 ' - GCGCAGAAGTTTCAGGGCCGGGTGACCATTACC- 
GCGGATGAAAGCACCAGCACCGCGTATATGGAACTGAGCAGCC 

TGCG -3 ' 

01ABH6 5 ' - GCGCGCAATAATACACGGCCGTATCTTCGCT- 
ACGCAGGCTGCTCAGTTCC -3' 

01BH2 5 ' - GGCTTTGCAGCTCACTTTCACGCTCGCGCCCGG- 
TTTTTTCACTTCCGCGCCGCTCTGAACCAATTGCACCTGGGC- 

TTTG -3 ' 

01BH3 5 ' - GAAAGTGAGCTGCAAAGCCTCCGGATATACCTT- 
TACCAGCTATTATATGCACTGGGTCCGCCAAGCCCCTGGGCAG 

GGTC -3 ' 

01BH4 5 ' - GCCCTGAAACTTCTGCGCGTAGTTCGTGCCGCC- 
GCTATTCGGGTTAATCCAGCCCATCCACTCGAGACCCTGCCCA 

GGGGC -3' 

01BH5 5 ' - GCGCAGAAGTTTCAGGGCCGGGTGACCATGACC- 
CGTGATACCAGCATTAGCACCGCGTATATGGAACTGAGCAGCC 

TGCG -3 • 

02H2 5'- GGTACAGGTCAGGGTCAGGGTTTGGGTCGGTTT- 
CACCAGGGCCGGGCCGCTTTCTTTCAATTGCACCTGGGCTTTG 

-3 • 

02H3 5'- CTGACCCTGACCTGTACCTTTTCCGGATTTAGC- 
CTGTCCACGTCTGGCGTTGGCGTGGGCTGGATTCGCCAGCCGC 

CTGGGAAAG - 3 ' 

02H4 5'- GCGTTTTCAGGCTGGTGCTATAATACTTATCAT- 
CATCCCAATCAATCAGAGCCAGCCACTCGAGGGCTTTCCCAGG 

CGGCTGG -3' 
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Figure 6: (continued) 

02H5 5'- GCACCAGCCTGAAAACGCGTCTGACCATTAGCA- 
AAGATACTTCGAAAAATCAGGTGGTGCTGACTATGACCAACAT 

GG -3' 

02H6 5'- GCGCGCAATAATAGGTGGCCGTATCCACCGGGT- 
CCATGTTGGTCATAGTCAGC -3 ' 

03H1 5'- CGAAGTGCAATTGGTGGAAAGCGGCGGCGGCCT- 
GGTGCAACCGGGCGGCAG -3 ' 

03H2 5 ' - CATAGCTGCTAAAGGTAAATCCGGAGGCCGCGC- 
AGCTCAGACGCAGGCTGCCGCCCGGTTGCAC -3 ' 
03H3 5 ' - GATTTACCTTTAGCAGCTATGCGATGAGCTGGG- 
TGCGCCAAGCCCCTGGGAAGGGTCTCGAGTGGGTGAG -3 ' 

03H4 5'- GGCCTTTCACGCTATCCGCATAATAGGTGCTGC- 
CGCCGCTACCGCTAATCGCGCTCACCCACTCGAGACCC -3 ' 

03H5 5'- CGGATAGCGTGAAAGGCCGTTTTACCATTTCAC- 
GTGATAATTCGAAAAACACCCTGTATCTGCAAATGAACAG- 3 ' 

03H6 5'- CACGCGCGCAATAATACACGGCCGTATCTTCCG- 
CACGCAGGCTGTTCATTTGCAGATACAGG -3 ' 
04H2 , 5 ■ - GGTCAGGCTCAGGGTTTCGCTCGGTTTCACCAG- 
GCCCGGACCACTTTCTTGCAATTGCACCTGGGCTTTG -3 ' 

04H3 5'- GAAACCCTGAGCCTGACCTGCACCGTTTCCGGA- 
GGCAGCATTAGCAGCTATTATTGGAGCTGGATTCGCCAGCCGC 

-3 ' 

04H4 5 ' - GATTATAGTTGGTGCTGCCGCTATAATAAATAT- 
AGCCAATCCACTCGAGACCCTTCCCAGGCGGCTGGCGAATCCA 

G -3' 

04H5 5'- CGGCAGCACCAACTATAATCCGAGCCTGAAAAG- 
CCGGGTGACCATTAGCGTTGATACTTCGAAAAACCAGTTTAGC 

CTG -3 ' 

04H6 5'- GCGCGCAATAATACACGGCCGTATCCGCCGCCG- 
TCACGCTGCTCAGTTTCAGGCTAAACTGGTTTTTCG -3 ' 
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Figure 6: (continued) 

05H1 5 ' - GCTCTTCACCCCTGTTACCAAAGCCGAAGTGCA- 
ATTG -3 ' 

05H2 5 ' - CCTTTGCAGCTAATTTTCAGGCTTTCGCCCGGT- 
TTTTTCACTTCCGCGCCGCTCTGAACCAATTGCACTTCGGCTT 
TGG -3 ' 

05H3 5 ■ - CCTGAAAATTAGCTGCAAAGGTTCCGGATATTC- 
CTTTACGAGCTATTGGATTGGCTGGGTGCGCCAGATGCCTGG 
-3 • 

05H4 5 ' - CGGAGAATAACGGGTATCGCTATCGCCCGGATA- 
AATAATGCCCATCCACTCGAGACCCTTCCCAGGCATCTGGCGC 
AC -3 ' 

05H5 5 ' - CGATACCCGTTATTCTCCGAGCTTTCAGGGCCA- 
GGTGACCATTAGCGCGGATAAAAGCATTAGCACCGCGTATCTT 
C -3' 

05H6 5 ' - GCGCGCAATAATACATGGCCGTATCGCTCGCTT- 
TCAGGCTGCTCCATTGAAGATACGCGGTGCTAATG -3 ' 

06H2 5 ' - GAAATCGCACAGGTCAGGCTCAGGGTTTGGCTC- 
GGTTTCACCAGGCCCGGACCAGACTGTTGCAATTGCACCTGG- 
GCTTTG -3 ' 

06H3 5 ' - GCCTGACCTGTGCGATTTCCGGAGATAGCGTGA- 
GCAGCAACAGCGCGGCGTGGAACTGGATTCGCCAGTCTCCTGG 
GCG -3 ' 

06H4 5 ' - CACCGCATAATCGTTATACCATTTGCTACGATA- 
ATAGGTACGGCCCAGCCACTCGAGGCCACGCCCAGGAGACTG- 
GCG -3 ■ 

06H5 5 ' - GGTATAACGATTATGCGGTGAGCGTGAAAAGCC- 
GGATTACCATCAACCCGGATACTTCGAAAAACCAGTTTAGCCT 
GC -3 ' 

06H6 5 ' - GCGCGCAATAATACACGGCCGTATCTTCCGGGG- 
TCACGCTGTTCAGTTGCAGGCTAAACTGGTTTTTC -3 ' 

OCLKl 5 ' - GGCTGAAGACGTGGGCGTGTATTATTGCCAGCA- 

GCATTATACCACCCCGCeGACCTTTGGCCAGGGTAC - 3 ' 
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Figure 6: (continued) 

OCLK2 5 ' - GCGGAAAAATAAACACGCTCGGAGCAGCCACCG- 
TACGTTTAATTTCAACTTTCGTACCCTGGCCAAAGGTC -3 ' 

0CLK3 5 • - GAGCGTGTTTATTTTTCCGCCGAGCGATGAACA- 
ACTGAAAAGCGGCACGGCGAGCGTGGTGTGCCTGCTG -3 ' 

OCLK4 5 ' - CAGCGCGTTGTCTACTTTCCACTGAACTTTCGC- 
TTCACGCGGATAAAAGTTGTTCAGCAGGCACACCACGC -3 ' 

0CLK5 5 ' - GAAAGTAGACAACGCGCTGCAAAGCGGCAACAG- 
CCAGGAAAGCGTGACCGAACAGGATAGCAAAGATAG -3 ' 

OCLK6 5 ' - GTTTTTCATAATCCGCTTTGCTCAGGGTCAGGG- 
TGCTGCTCAGAGAATAGGTGCTATCTTTGCTATCCTGTTCG ■ 

3 • 

0CLK7 5 ' - GCAAAGCGGATTATGAAAAACATAAAGTGTATG- 
CGTGCGAAGTGACCCATCAAGGTCTGAGCAGCCCGGTG -3 ' 
0CLK8 5 ' - GGCATGCTTATCAGGCCTCGCCACGATTAAAAG- 
ATTTAGTCACCGGGCTGCTCAGAC -3 ' 

OCHl 5'- GGCGTCTAGAGGCCAAGGCACCCTGGTGACGGT- 
TAGCTCAGCGTCGAC -3 ' 

0CH2 5 ' - GTGCTTTTGCTGCTCGGAGCCAGCGGAAACACG- 
CTTGGACCTTTGGTCGACGCTGAGCTAACC -3 ' 
0CH3 5'- CTCCGAGCAGCAAAAGCACCAGCGGCGGCACGG- 
CTGCCCTGGGCTGCCTGGTTAAAGATTATTTCC -3 ' 
OCH4 5'- CTGGTCAGCGCCCCGCTGTTCCAGCTCACGGTG 
ACTGGTTCCGGGAAATAATCTTTAACCAGGCA -3 ' 
0CH5 5'- AGCGGGGCGCTGACCAGCGGCGTGCATACCTTT 
CCGGCGGTGCTGCAAAGCAGCGGCCTG -3 ' 
OCH6 5'- GTGCCTAAGCTGCTGCTCGGCACGGTCACAACG 
CTGCTCAGGCTATACAGGCCGCTGCTTTGCAG -3 ' 
0CH7 5'- GAGCAGCAGCTTAGGCACTCAGACCTATATTTG 
CAACGTGAACCATAAACCGAGCAACACC -3 ' 
0CH8 5'- GCGCGAATTCGCTTTTCGGTTCCACTTTTTTAT 
CCACTTTGGTGTTGCTCGGTTTATGG -3 ' 
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Figure 253: List of unique restriction sites used in or suitable for HuCAL genes or pCAL vectors 



uniqu e rest riction site 
AatlT 



Afili 



BamHI 



Bbel 



Bbsl 



Bglll 



BIpl 



BsaBI 



Isoschizomers 
I 



Bfrl, BspTi. Bst98l 



L 

Vspl, Asnl. PshBI 



BstI 



Ehel, Kasi, Narl 



BpuAl, Bpil 



/ 



Bpu1102I.Ceill. BIpl 



Maml. Bsh1365l, BsrBRI 



BsiWI 



Pfl23ll, Spli. Sun 



BspEI 



BsrGI 



Acclll. BseAI. BsiMI. Kpn2l, Mrol 



Bsp1407l, SspBI 



BssHII 



BstEII 



BstXI 



Bsu36l 



Drain 



Paul 



BstPI, E co91l. Eco0651 

7 



AocI, Cvnl, EcoSr 



/ 



DsmAI 



EagI 
Eco57l 
Eco0109l 
EcoRI 
EcoR V 
Fsel 
Hindlll 
Hpal 
Kpnl 
Mlul 
MscI 



BstZI. EclXl, Ec o52l,Xmalll 
/ 

Drall_ 

i 

Eco32l 



i 
I 
I 

Acc65l. Asp_718l^ 

_ I 
Ball, MluNI 



SUBSTITU^IffllT(RULE26) 



wo 97/08320 



PCT/EP96/03647 



Figure 253: List of unique restriction 
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Figure 28: functional map and sequence of pMCS cloning vector 
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Figure 35b: List of oligonucleotides used for synthesis of modules 

U\V?_^^ . icing template 
NoVspAatll:TAGACGTC 

|y/i9- ^i ynthesis 

BloxA-A:TAT6A6ATaCATAACTrC6TATAAT6TACGCTATACG- 

CaItaataacttcgtatagcatacattatacgaagttatg- 
agatctca 

VCR. N ^^'p'^"^" ''^''""^ "'i'^'^ 
XloxS-muta:CATTTTTrGCCCTCGnATCTACGCATGCGATAACTTCGTA- 

TAGCGTACATTATACGAAGTTATTCTAGACATGGTCATAGCTGTnCCTG 

^lSS.w:GGG6GGAATTCG6T6GTGGT66ATCTGCGTGCGCTG- 
AAACGGTTGAAAGTTG 

glllNEW-rev:CCCCCCCAAGCrrATCAA6ACTCCTTATTACG 
M7-II: PCR 

g,llss-fow:GGGGGGGGAATTCGGAGGCGGTTCCGGTGGT6GC 

^Jl!^w-fow: GGGGGGGGAATTCGAGCAGAAGaGATCTCT- 

SGTGGCTCTGGTTCCGGTGATTTTG 
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Figure 35b: List of oligonucleotides used for synthesis of modules (continued) 

Mft! s ynthesis 

lox514-A:CCATAAOTCGTATAATGTACGCTATACGAA6TTATA 
lox5U-B:AGCnATAACnCGTATA6C6TACATTATACGAAGT- 

TATG6CAT6 
MQII- s ynthesis 

M9ll-fow: AGCnGACCTGTGAAGTGAAAAATGGCQCAGAn- 

GTGCGACATTTTTrfTGTCTGCCGTTTAAnAAAGGGGGGGT 

M9ll-rev:GTACACCCCCCCCCAGGCCGGCCCCCCCCCCCCTTTAA- 

nAAACGGCAGACAAAAAAAATGTCGCACAATCTGCG 

Minn- assembly ^'■^'^ template 
bla-fow.GGGGGGGTGTACATTCAAATATGTATCCGCTCATG 

bla-seq4: GGGTTACATCGAACT66ATCTC 

blal -muta: CCAGITCGATGTAACCCACTCGCGCACCCAACTGATC- 

CTCAGCATCTTTFACTnCACC 

blall-muta-.ACTCTAGOTCCCGGCAACAGTFAATAGACTGGATG- 
GA6GCGG 

bia-NEW- CTGTTGCCGGGAAGCTAGAGTAAG 
bla-rev:CCCCCCCTrAATTAAGGGGGGGG6CCGGCCATTATCAAA- 

AAGGATCTCAAGAAGATCC 
Mnil/HltPC P c-.tP-Hirpcted mutagenesis 
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fl-rev CCCCCCaGTACATGAAATTGTAMCGTrAATATmG 
f1-t133.muta: GGGCGATGGCCCAaACGAGAACCATCACCCTAATC 



aw mbl y P^" "^'m tpmplate 
pi 5-f ow: 6GGGG6AG AiaAATAAGATGATCTTCTTGAQ 
pi 5-NEWI: GAGTTG6TAGCTCAGAGAACCTACGAAAAACCGCCCTG- 

CAAGGCG 

pIB-NEWll: GTAGGTTCTCTGAGCTACCAACTC 

pi 5-NEWlll: GTHCCCCaGGCGGCTCCacaGCGaCTCCTGlTCCT- 

GCC 

pi 5-NEWlV: AGGAGGGAGCCGCCAGG6GGAAAC 
pi 5-rev: GACATCAGCGaAGCGGAGTGTATAC 

MIT ■; vnthesis 

BloxXB-A:6ATCTCATAACTTCGTATAATGTATGCTATACGAAGnA- 
TTCA 

BloxXB-B: 6ATCTGAATAACTTCGTATAGCATACATTATACGAAGTTA- 
TGAGA 

|vM^.Fvt9- PCR. Sitr ftir-f"< r^-taqenesis 
ColEXT2-fow:GGGGG6GAGATCTGACCAAAATCCCTTAACGTGAG 

Col-mutal:GGTATaGCGCTaGaGTAGCCA6TTACCrTCGG 
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Figure35b:Listof Oligonucleotides used for synthesisofm^^^^^^^^^ 

Col-rev: CCCCCCCGCTA6CCATGTGA6CAAAAG6CCA6CAA 



^yl17» a<:«;pmbl Y P^^" "^'"T tpmplate 
CAT-1 : GGGACGTCGGGTGAGGTTCCAAC 
CAT-2: CCATACGGAACTCCGGGTGA6CATTCATC 
CAT-3: CCGGAGTTCCGTATGG 
CAT-4: ACGTTTAAATCAAAACTGG 

CAT-5: CCAGirnGAmAAACGTAGCCAATATGGACAACncnC- 

GCCCCCGTnrCACTATGGGCAAATATT 

CAT-6: GGAAGATCTAGCACCAGGCGTTTAAG 

MAI ! assemb ly P^R '"^'"1 template 

LAC1 : GAGGCCGGCCATCGAATGGCGCAAAAC 

LAC2- CGC6TACCGTCCTCATGGGAGAAAATAATAC 

LAC3:CCATGAGGACGGTACGCGACTGGGCGTGGAGCATCTGGTCGCA- 

TTGGGTCACCAGCAAATCCGCTGTTAGCTGGCCCATTAAG 
IAC4:GTCAGCGGCGGGATATAACATGAGCTGTCCTCGGTATCGTCG 

LAC5- GTTATATCCCGCCGCTGACCACCATCAAAC 
LAC6:CATCAGTGAATCGGCCAACGCGCGGGGAGAGGCGGmGCGT4TT6- 

GGAGCCAGGGTGGTmrC ^.^^-p^pp 
lAC7:6GnAAnAACCTCACTGCCCGCmCCAGTCGGGAAACCTGTCGTGC^^ 

A6CTGCATCAGTGAATCGGCCAAC 

M41 -MCS-fow: CTAG ACTAGTGTITAAACCGG ACCGGGGG6GG6C1T- 
AAGGGGGGGGGGGG substitute sheet (RUiE 23} 
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Figure 35b: List of oligonucleotides used ^^'^^'l';:'!'^^^^^^ 

M41-MCS-rev:CTAGCCCCCCCCCCCCTTAAGCCCCCCCCCGGTCCbbl 
TTAAACACTAGT 

M41-fow:CTAGACTAGT6TTrAAACCGGACCGGGGGGGGGCTrAA- 
GGGGGGGGGGGG 

M41 -rev: CCCCCCCTFAAGTGGGCTGCAAAACAAAACGGCCTCC- 
TGTCAGGAAGCCGCTTTTATCGGGTAGCCTCACTGCCCGCTnCC 
M41 -A2: GTTGITGTGCCACGCGGTTAGGAATGTAATTCAGCTCCGC 
M41-B1 : AACCGCGTGGCACAACAAC 

M41 -B2: C1TC6TCTACCATCGACACGACCACGCTGGCACCCAGTTG 

M41 -C1 : GTGTCGATGGTAGAACGAAG 
M41-CII:CCACAGCAATAGCATCCTGGTCATCCAGCGGATAGTT- 

AATAATCAGCCCACTGACACGTTGCGCGAG 

M41-DI: GACCAGGATGCTATTGCTGTGG 
M41-DII:CAGCGCGATnGCTGGTGGCCCAATGCGACCAGATGC 

M41-EI: CACCAGCAAATCGCGaG 

M41-Ell:CCC6GACTC6GTAATGGCACGCATrGCGCCCAGCGCC 
M41 -Fl: GCCATTACCGAGTCCGGG 

^/^A9• s ynthesis 

Eco-H5-Hmd-fow.AATTCCACCATCATCACCAn6AC6TCTA 
Eco-H5-Hind-rev:AGCTTAGAC6TCAAT66TGATGATe6TG6 
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Figure 37: Oligo and primer design for Vk CDR3 libraries 
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Figure 37: Oligo and primer design for VkCDR3 libraries 
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Figure 37: Oligo and primer design for Vk CDR3 libraries 
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Figure 37: Oligo and primer design for Vk CDR3 libraries 
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Figure 38: Oligo and primer design for VX CDR3 libraries 
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Figure 38: Oligo and primer design for VX. CDR3 libraries 
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Figure 38: Oligo and primer design for Vk CDR3 libraries 
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Figure 38: Oligo and primer design for VX CDR3 libraries 
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